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Reform the Antarctic Treaty 


Political protection for the planet’s last great wilderness is no longer fit for purpose. Make its 
governance democratic: scrap the veto that lets individual interests rule. 


polar region, ‘pristine’ is among the most inappropriate. The 

ocean around Antarctica bobs with pieces of microplastic pol- 
lution, and for decades, whales and other marine life have been stripped 
from the sea. The ozone hole gapes above. To find any of the advertised 
unspoiled wilderness, a visitor has to trek inshore and away from the 
direct influence of the rest of the world. Because there is another misap- 
plied label: remote. It might look isolated on a map, but the Antarctic is 
within increasingly convenient reach — for good and for bad. 

Campaigners last week said that Antarctic snow samples they had 
gathered were polluted with persistent hazardous chemicals. And 
figures presented at the annual meeting of parties to the Antarctic 
Treaty last month in Buenos Aires showed that 45 private yachts were 
spotted in sensitive Antarctic waters — or reported an intention to 
visit — over the most recent southern summer season. That's up by 
one-third on the previous year. Nine did so without permission, and 
crew and passengers on at least one were seen to violate strict protec- 
tions by approaching birds’ nests, flying drones through rookeries and 
touching animals. 

The tasks of setting rules to control all this behaviour and working 
to protect the continent from (further) harm fall heavily on delegates 
from dozens of countries who attend those Antarctic Treaty meetings. 
And the Buenos Aires gathering did have some success, drafting new 
rules on drone use and settling other minor issues. But when it comes 
to measures to address the bigger challenges, not least how to conserve 
fish and the other marine life that survives in the Southern Ocean, the 
treaty is at the mercy of geopolitics — and there are worrying signs 
that it is struggling to cope. 

Nature this week publishes a series of articles discussing this and 
other issues that are emerging in Antarctica. An Insight supplement 
explores the mechanisms that control the movement of Antarctica’s 
ice and interactions with the broader climate system (see page 199). 
A Comment describes the perilous position of Antarctic fisheries 
(see page 177). And a World View (see page 163) makes the case for 
stronger collaboration and sharing of research infrastructure between 
scientists from different countries who work in Antarctica. 

Pressure on the Antarctic Treaty from geopolitics can only increase, 
as demand for the continent’s stocks of fish and expected reserves of 
minerals rises with the depletion of resources elsewhere. Formally, 
the treaty prohibits mining until at least 2048. But the protection that 
the agreement offers increasingly relies on good will as much as inter- 
national law, as shown by the determination of some nations to con- 
tinue fishing in Antarctic waters, in the face of proposals to ban the 
practice by establishing reserves and protected zones. 

In one sense, it’s difficult for science to lament the unwelcome intru- 
sion of international politics into its Antarctic playground. The 1959 
creation of the Antarctic Treaty — which fenced the territory off for 
research — was itself an act of supreme realpolitik, a way for a resur- 
gent post-war United States to gain influence on a continent it had 


() f the common adjectives used to describe Earth’s southern 


previously ignored. In agreeing to set aside their overlapping territorial 
claims — for example, the United Kingdom, Chile and Argentina all 
contest the same region — the other founding nations also found the 
treaty convenient, because they could put off any attempts to resolve 
their disputes without losing face. It’s partly for this reason that the 
treaty is suggested as a model of possible future governance in regions 

of tension elsewhere, such as the South China Sea. 
But although science has been the face of national expansion into 
the Antarctic in the decades since the treaty began, strategic interests 
have always been close behind. And now that 


“The treaty is they are threatening to take the lead, what 
at the mercy can science do to protect its own needs in the 
of geopolitics region — as well as the planet’s last remain- 
— and there ing chunk of relatively unspoiled land, in the 
are worrying Antarctic interior? 

signs that it is First, the scientific community should 
struggling to recognize the scale of the challenge. The 
cope. ” Antarctic Treaty has stood for a long time 


and although it might look solid, it is fragile 
and vulnerable to special interests — a bit like Antarctica itself. Some 
53 nations now contribute to the governance of Antarctica through the 
treaty system, and notasa democracy. Individual countries can veto 
measures they dislike, allowing them to continue activities that the 
majority wish to outlaw, which is one reason why the system has not 
produced any new binding protocols (measures to enforce the treaty’s 
principles) for two decades. Just as the original countries agreed to 
postpone arguments over their national claims when they signed the 
treaty, current members prefer to kick difficult decisions down the 
road. But it’s often those decisions — such as how to punish nations 
that break the rules — that scientists and other Antarctic advocates 
care about the most. 

That’s not to say that science has lost its voice. If anything, the 
opposite is true. An entire generation across the world has grown 
up seeing science as a priority for Antarctica under the treaty. And 
most of those people would surely object to the idea of science and 
conservation being tossed aside so that the Antarctic wilderness can 
be fished, mined, polluted or developed. Scientists can strengthen 
and harness such support by relentlessly telling the public and 
policymakers about the seriousness of the threat to Antarctica and 
the need to protect the region. The stakes are high: this really is the 
last chance we have to leave a piece of the planet close to the way 
we found it. 

Ultimately, changes in governance will probably be required to 
maintain the primacy of science over exploitation in Antarctica, and to 
minimize the environmental damage that changes to the polar region 
will cause to the rest of the planet. The Antarctic Treaty was a triumph 
of global politics, but global politics has changed. Even though any 
voting system is subject to gaming, the time of the single-country veto 
has passed. Let the future of Antarctica be decided by the majority. = 
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Chinese checkers 


China sets a strong example on how to address 
scientific fraud. 


science budget — the world’s second-largest by country — goes 

to waste on bad science. It doesn’t want to waste any more. 
On 30 May, the State Council and the Communist Party of China 
announced a radical new system of regulations to police science and 
raise research standards in the country. 

Certainly, reform is necessary and overdue. Various Chinese govern- 
ment bodies have made the case to crack down on fraud and misconduct 
in science over the past two decades, but with limited success. This time, 
the changes have serious political weight behind them and could make 
asignificant difference. The policy might offer the greatest disincentive 
to cheating in research that the world has seen so far. But the devil, as 
always, will be in the detail — and in how well the plans are enforced. 

One of the most striking conditions is that researchers will be 
deterred from publishing findings in journals that China deems to be 
of poor academic quality, poorly managed and set up merely for profit. 
Many such ‘predatory journals’ offer researchers a place to publish, for 
a fee, and shirk their editorial responsibility to evaluate papers to deter- 
mine quality. China’s science ministry is working on a blacklist of those 
journals. In an unprecedented step, any researcher who publishes in 
one will get a warning and be given no credit for the publication when 
they are evaluated for grants or jobs. Using government grants to pay 
the publication fees in these journals, as many presumably do, could 
land Chinese scientists in deeper trouble. 

As the world’s largest producer of scientific papers, China’s new rules 
could go as far as to put some of these rogue journals out of business, 
and that could be good for scientists everywhere. (Although, as we dis- 
cuss in a News story this week on page 171, some scientists are anxious 
about how these journals are identified, while others have concerns 


r | Vhe Chinese government knows that a slice of its generous 


about such blacklists and prefer ‘whitelists’ of approved publications.) 

In another major shift, China is handing the responsibility for 
deterring and investigating scientific misconduct to the government's 
science ministry. That’s quite a shake-up for China, where — as in 
many places — institutions are usually expected to investigate allega- 
tions against their own researchers. That is too often ineffective. With 
little to gain and a reputation to lose, many prefer to sit on their hands 

and wait for the situation to blow over. 


“The policy Denmark, for instance, has designated a 
mig ht offer national agency to police science, but, too 
the greatest frequently, there is limited will and scant 
disincentive resources to pursue allegations of fraud 
to cheating in at the government level. In the United 
researchthatthe States, for example, the Office of Research 


Integrity is short-staffed and has limited 
leverage over universities. 

In China, the situation could play out differently. The new rules 
state explicitly that institutions that shield errant scientists can be 
punished through a loss of funding. That could give the policy real 
teeth — enough to drastically clean up Chinese research. But suc- 
cess will take sustained effort and pressure from the top, and because 
there is no guarantee of that, the policy could equally fall flat. China’s 
bureaucrats are not responsive to its citizens — no matter how loud the 
cry on social media for an investigation into a given scientist — and 
they make almost no effort to be transparent. The science ministry 
could stick its head in the sand just as deeply as some institutions do. 

There are other causes for concern. The science ministry is also 
drawing up rules on how penalties will be meted out — including the 
blacklisting of scientists who have committed particularly egregious 
acts. To maintain fairness, harsh penalties require assurances that the 
judgements leading to them are based on thorough and fair evaluations. 

China's bureaucrats might not answer to the people, but they do 
answer to the higher echelons of power. The current push for better 
management of science comes as part of President Xi Jinping’s wider 
anti-corruption drive. Xi regularly talks up the crucial role of science 
and technology in making China stronger and more independent. 
With its new rules, China is backing words with actions. m 


world has seen.” 


Not necessary 


Phrase ‘necessary and sufficient’ blamed for 
flawed neuroscience. 


Orwell argued that “if thought corrupts language, language can 

also corrupt thought”. Can the same be said for science — that the 
misuse and misapplication of language could corrupt research? Two 
neuroscientists believe that it can. In an intriguing paper published 
in the Journal of Neurogenetics, the duo claims that muddled phrasing 
in biology leads to muddled thought and, worse, flawed conclusions 
(M. Yoshihara and M. Yoshihara J. Neurogenet. 32, 53-64; 2018). 

The phrase in the crosshairs is “necessary and sufficient”. It’s 
a popular one: figures suggest the wording pops up in some 
3,500 scientific papers each year across genetics, cell biology and 
neuroscience alone. It’s not a new fad: Nature’s archives show 
consistent use since the nineteenth century. 

Used properly, the phrase indicates a specific relationship between 
two events. For example, the statement, “Ill pay for lunch if, and only 
if, you pay for breakfast,” can be written as, “You paying for breakfast 
is necessary and sufficient for me paying for lunch” 

But, argue Motojiro Yoshihara and Motoyuki Yoshihara, use of the 
phrase in research reports is problematic, and should be curtailed. 

The logic of the term is at the heart of the dispute. It’s too often used 
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as shorthand to mean ‘linked to’ or ‘important for, the authors say. And 
this sloppy use, they argue, can lead scientists in the wrong direction, 
especially in genetics. 

Ifa gene is necessary and sufficient for something (as often claimed), 
strict logic demands that that gene alone can do the job. For example, 
the gene eyeless is certainly necessary for a retina to develop. But it is 
not sufficient — if it were, then logic would demand that ‘if eyeless 
exists, then a retina will develop. This is false; other genes and factors 
are needed as well. Yet eyeless is often described as being necessary and 
sufficient for retinal development. 

The duo argues that its objection to such incorrect use is more 
than pedantry. The combination of necessary and sufficient is exces- 
sively strict, and its widespread use has meant, for example, that some 
‘command’ neurons have failed to be identified as such because they 
dontt satisfy the required criteria. (The agreed definition of acommand 
neuron is one that is necessary and sufficient to initiate a behaviour.) 

One such missed neuron is the Mauthner cell, responsible for a 
fast-escape reflex in fishes and amphibians. In fact, so few command 
neurons satisfy the logic of the phrase that the concept that they exist 
at all has been undermined, the authors say. 

In most cases, they propose, a better phrase than ‘necessary and 
sufficient would be “indispensable and inducing”. (Number of uses 
so far: one, in their paper.) 

Will it catch on? Biologists will no doubt counter that they use the 
‘necessary and sufficient’ phrase in a mutually understood way that 
is separate from its logical roots. Perhaps, but then Orwell had that 
covered, too: “A bad usage can spread by tradition and imitation even 
among people who should and do know better.” = 
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idea of a continent free of discord and set aside for science. 

In 1959, nations pledged to give up territorial claims on the 
region, to use the continent only for peaceful purposes, to cooperate 
on scientific investigations and to freely share results. An addendum 
called the Environmental Protocol to protect the region and prohibit 
mining came into force in 1998. No other major agreements under the 
treaty have been enforced since (see page 161). 

Meanwhile, research infrastructure is booming. Nations including 
China, Germany, South Korea, New Zealand and the United Kingdom 
have been building new research stations or upgrading existing ones. 
Australia, Norway and the United Kingdom have launched or will soon 
launch research vessels much larger than their predecessors. In May, 
Australia announced that it would, subject to environmental approvals, 
build the first paved runway on the continent, a 2.7-kilometre airstrip 
that will open up East Antarctica to large aircraft 
all year round. Several other countries are also 
upgrading their air access. 

Why all the interest? Understanding the region 
is essential to predicting global environmen- 
tal change, particularly sea-level rise — with its 
implications for people and property. And there 
are geopolitical considerations, too. More coun- 
tries want to maintain a presence below the 60th 
parallel south, especially as the Southern Ocean 
becomes more accessible to fishing (see page 177). 

But too few nations are building the intellectual 
capacity to complement the accumulating infra- 
structure. They should be looking for ways to 
share research resources and coordinate efforts. 
Diffuse activity will not provide the kind of information that world 
leaders can act on. It is time to focus. 

The Scientific Committee on Antarctic Research (SCAR), which I 
head, was established almost at the same time as the treaty, and acts 
as a scientific adviser for the region. It already coordinates activities 
ranging from marine-predator surveys to space-weather forecasts, 
which provide geomagnetic-storm warnings that are necessary to 
secure electricity grids and satellites. 

SCAR is often asked for advice on everything from bioprospecting 
to the potential influence of drones on wildlife, but is rarely funded 
to provide it. Countries that readily invest millions of dollars in infra- 
structure often struggle to find funds for policy-relevant science. They 
would get a much better return on their investment if they put a tiny 
fraction of infrastructure spending towards coordinated research. 

The remit of SCAR should be formally expanded to coordinate 
resources more broadly and to formulate scientific questions whose 
answers could shape global agreements. One crucial question is how 
long the Southern Ocean will continue to take up carbon as its waters 
warm and acidify. Another is how changing krill populations will 
alter ecosystems, which include iconic predators such as penguins, 
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Polar collaborations are 
key to successful policies 


Expand the remit of the Scientific Committee on Antarctic Research to 
coordinate the influx of infrastructure, says its president, Steven L. Chown. 


seals and whales, and threaten fishing grounds. 

The most urgent task is to understand the Antarctic ice sheets well 
enough to reduce uncertainties about sea-level change. More research 
is needed into how ice shelves buttress ice sheets, how the ocean, ice 
shelf and atmosphere interact, how melting water fractures ice shelves 
and how snowfall on the continent is changing. 

Clarity on these fronts is needed to hold signatories of the Paris 
climate agreement to their promises to reduce greenhouse-gas emis- 
sions and to help adapt to coming changes. Without good models, much 
planning effort will be wasted. In fact, without better information from 
the Antarctic, global human populations near the coasts will not be able 
to work out how to accommodate higher sea levels and more-frequent 
storm surges. Failure to plan appropriately will make it harder to act not 
just on climate change but also on migration, justice and conservation. 

Gaining the needed clarity will require shared infrastructure and 
coordination. This could take the form of joint 
research cruises to collect sediment and ice cores, 
sub-ice-shelf investigations by autonomous 
vehicles and modelling to improve estimates of 
changes in local ice-shelf and ice-sheet behaviour 
across the Antarctic. 

In April, the United Kingdom and United 
States announced they would work together to 
study the broad, fast-moving Thwaites Glacier 
flowing into the Amundsen Sea. Other research- 
ers could join up throughout the region, and 
SCAR could facilitate and support more such 
efforts, align modelling and empirical data gath- 
ering and ensure that data and infrastructure, 
such as ships and stations, are readily available. 

SCAR has a track record of facilitating successful collaborations. 
These include the discovery of the ozone-layer hole and elucidation of 
its chemistry, and clarification of the relative importance of food-web 
paths in Southern Ocean ecosystems. Doing such science is, however, 
quite different from ensuring that its results inform policy. 

Happily, SCAR has another unique capability, one that drew me 
to the organization. The committee is a designated adviser to the 
parties to the Antarctic Treaty, and their delegations are well placed 
to influence decisions about the region and its neighbours. SCAR 
can also advise global bodies such as the Intergovernmental Panel 
on Climate Change, the United Nations Framework Convention on 
Climate Change and the Convention on Biological Diversity. 

In other words, SCAR is poised to coordinate research effectively 
and to convey its findings to the bodies that are best able to act on the 
information. Parties to the treaty should seize this unique opportunity 
to support science that can bring better decisions for the planet. m 


Steven L. Chown is professor of biological sciences at Monash 
University in Melbourne, Australia, and president of SCAR. 
e-mail: steven.chown@monash.edu 
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Astronaut minister 


Astronaut Pedro Duque was 
appointed Spain’s minister 

of science, innovation and 
universities on 7 June. The 
country’s socialist party 
formed a new minority 
government after a vote of 

no confidence ousted the 
previous prime minister. 
Spain’s government science 
budget plunged by more than 
30% after the 2008 financial 
crisis, and scientists say that 
austerity measures have added 
bureaucracy around research 
expenses. The government has 
committed to sticking with 

an austerity budget for 2018, 
but Duque says that he will try 
to ease some cost-controlling 
rules that delay laboratory 
purchases and the spending of 
grants. Duque was Spain’s first 
astronaut, with the European 
Space Agency. He flew on the 
space shuttle Discovery in 1998 
and to the International Space 
Station in 2003, where he ran 
science experiments. 


US science envoys 


A former astronaut is among 
five science envoys announced 
by the US Department of State 
on 8 June. In a programme 
begun under former 

US president Barack Obama, 
the envoys travel the world 

to strengthen international 
cooperation in science and 
technology. The new class 
comprises former NASA chief 
Charles Bolden, who flew 
aboard four space-shuttle 
missions; Robert Langer, 

a chemical engineer at the 
Massachusetts Institute of 
Technology in Cambridge; 
Michael Osterholm, an 
infectious-disease researcher 
at the University of Minnesota 
in Minneapolis; Rebecca 
Richards-Kortum, a 
bioengineer at Rice University 
in Houston, Texas; and James 
Schauer, who studies air 
quality at the University of 


Africa’s iconic baobab trees are dying 


The oldest and largest baobab trees in Africa 

are dying — but no one knows why. The iconic 
African baobab tree (Adansonia digitata), 

which sports a wide trunk and high branches, 

is Earth’s oldest living flowering plant, and it 

can live for 2,000 years. In a study intended to 
investigate the age, structure and longevity of 
Africa's largest and potentially oldest baobabs, 
researchers were surprised to find that several of 


Wisconsin—Madison. Such 
appointments are rare for the 
administration of current 

US President Donald Trump, 
which has appointed very few 
science advisers. 


Verma resigns 

Cancer researcher Inder Verma 
has resigned from the Salk 
Institute for Biological Sciences 
in La Jolla, California, the 
institute said on 11 June. 
Verma had been temporarily 
suspended since 21 April, after 
several female scientists with 
ties to the institute alleged 

that he had sexually harassed 
and discriminated against 
them. The Salk Institute then 
began an investigation of the 
complaints, which Verma has 
repeatedly denied, and on 
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11 June its board of trustees 
met to discuss the findings 

of that probe. According to 
the institute, Verma resigned 
before the Salk board of 
trustees had concluded its 
investigation. At press time, 
Verma had not responded to 
Nature’s request for comment. 


PS FUNDING 
Private money 


US research institutions 
received more than 

US$2.3 billion for basic 
science from philanthropic 
organizations and 
corporations in 2017, 
according to a survey by the 
Science Philanthropy Alliance 
in Palo Alto, California. Now 
in its third year, the survey 


the trees died during the study period. Between 
2005 and 2017, they dated more than 60 trees 
and found that during that period, 9 of the 
oldest 13 trees died, as did 5 of the 6 largest ones. 
The researchers, who published their findings 
on 11 June, found no signs of an epidemic or 
disease; they suggest that changes in climate 
could be to blame (A. Patrut et al. Nature Plants 
http://doi.org/cqsh; 2018). 


looked at 46 institutions and 
showed that private funding 
for basic science remained flat 
from 2016 to 2017. Among the 
25 institutions for which the 
alliance has data from all three 
years, however, funding rose 
to $1.7 billion in 2017, a 13% 
increase from 2016 and a 40% 
increase from the $1.2 billion 
that the survey registered in 
2015. 


Hawking fellows 
The United Kingdom has 
established a postdoctoral 
fellowship scheme in 
honour of the late physicist 
Stephen Hawking. Over 

the next five years, national 
science funder UK Research 
and Innovation will award 
up to ten grants each year for 
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exceptional graduate students 
in mathematics, physics 

and computer science. The 
funding, details of which are 
yet to be released, will allow 
the students to continue the 
work from their doctorates at 
any institution in the country 
for up to three years. Science 
minister Sam Gyimah said that 
the fellowships offer a “fitting 
tribute” to Hawking, who died 
in March and whose work 
changed our understanding of 
the Universe. 


Record flooding 
High-tide flooding in US 
coastal areas is twice as 
frequent as it was 30 years ago, 
according to a report released 
by the National Oceanic and 
Atmospheric Administration 
on 6 June. Tide gauges at 98 
locations around the country 
(excluding Alaska) measured a 
record-high average of 6 flood 
days between May 2017 and 
April 2018. Sabine Pass, Texas, 
recorded the highest number 
of flood days, at 23. Although 
some of the flooding coincided 
with storms, much of it was a 
result of sea-level rise. 


European science 


The European Commission 
has outlined how it plans to 
spend the biggest research 
and innovation budget 

in its history. On 7 June, 


TREND WATCH 


About 70% of people in the 


United States think it is essential 


that their country remains a 


world leader in space exploration, 


a national survey of 2,541 people 
finds. And, in a time when the 
private space sector is growing, 
65% say that NASA should still 
play a key part in space discovery. 
Respondents said that NASA's top 
priority should be monitoring 
Earth’s climate system. But the 
idea of sending astronauts to the 


Moon or Mars — two priorities of 


Donald Trump’s administration 
— didn't prove popular. 


Carlos Moedas, the European 
Union's commissioner for 
research (pictured), presented 
a detailed proposal for the 
structure of the Horizon 
Europe science-funding 
programme, which will run 
from 2021 to 2027 and has a 
provisional budget of nearly 
€100 billion (US$117 billion). 
Horizon Europe will have 
three main pillars: one each 
serving basic research and 
innovation, and another 
directed at solving societal 
challenges and at boosting 
industrial competitiveness. 
The programme will be open 
to all countries worldwide 

for the first time, giving 

the United Kingdom an 
opportunity to take part 

after Brexit. See go.nature. 
com/2llw3jf for more. 


Wild horses spared 
Science and environment 
groups in Australia have 
slammed new laws that 

will protect wild horses in 

New South Wales largest 
national park. State politicians 
passed legislation to protect the 


WHAT SHOULD NASA DO? 


heritage value of the horse — 
an invasive species known as a 
brumby — on 6 June. It rules 
out a cull recommended by 
the state government's own 
environment department to 
reduce the number of horses to 
preserve delicate ecosystems in 
the Kosciuszko National Park. 
The Australian Academy of 
Science says that the laws go 
against scientific advice, while 
the International Union for 
Conservation of Nature warns 
that the bill sets a disturbing 
precedent. Its passing 
prompted ecologist David 
Watson — a member of the 
state government's scientific 
advisory committee on 
threatened species — to resign 
in protest. 


Biotech start-up 


A non-profit institute launched 
by the Bill & Melinda Gates 
Foundation earlier this year 

got its official introduction 
ina7 June announcement 

at the BIO convention in 
Boston, Massachusetts. The 
Bill & Melinda Gates Medical 
Research Institute, which will 
have offices in both Boston and 
Seattle, Washington, will focus 
on diseases of poverty in low- 
and middle-income countries. 
With US$273 million for its 
first four years, the group's 
researchers will probe findings 
from early-stage studies 
conducted at universities and 
biotechnology companies in 


In a survey of 2,541 US adults, most identified climate 


monitoring as a top priority for NASA. 


Hi Top priority Important but lower priority 
BB Not too important or should not be done 


Monitor key parts of 
Earth’s climate system 


Monitor asteroids or objects 
that could hit Earth 


Conduct basic scientific research 
to increase knowledge of space 


Search for life and for planets 
that could support life 


Send astronauts to Mars 


Send astronauts to the Moon 


*Percentages do not add up to 
100% because of rounding. 


31 42 27 


18 45 37 


42 44 


L Percentage of 2,541 respondents*_! 
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search of promising leads on 
cures for tuberculosis, malaria 
and diarrhoea — and aim to 
develop them until they are 
ready for clinical trials. The 
institute will then pass them 
to companies in return for 
commitments to make the 
products available to those in 
need. 


Malaria progress 
Paraguay is malaria-free, the 
World Health Organization 
(WHO) said on 11 June. The 
South American country 
hasn't had a single case of 
malaria in the past six years. 
The WHO reported the news 
in an update on 21 countries 
that are aiming to eliminate 
the disease by 2020. The 
report highlights progress 

in China, El Salvador and 
Algeria, which each had fewer 
than ten cases last year. But 
South Africa, Botswana and 
several other countries hoping 
to eliminate the disease have 
instead seen a rise in cases. 
Resurgences are often caused 
by gaps in health care and 
lapses in mosquito control. 


PS SPACE 
Space missions 


On 6 June, India’s government 
gave its space agency approval 
to help Oman build a space 
programme. A memorandum 
of understanding, signed by 
the two nations in February, 
calls for cooperation in space 
science, planetary exploration, 
satellite-based navigation 

and remote sensing of Earth. 
Oman joins a growing number 
of countries developing 

space programmes. The 
neighbouring United Arab 
Emirates announced last 
week that it has decided on 

a shortlist for its astronaut- 
training initiative. The list 

of 95 men and women will 

be reduced by the end of the 
year to 4 astronauts, who will 
participate in international 
space missions from 2021. 
The country is also planning 
an uncrewed mission to Mars 
in 2020. 


> NATURE.COM 
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Colombia has greater biodiversity than any country except Brazil. 


Peace is killing Colombia’s 


jungle — and opening it up 


Scientists are racing to document once-inaccessible regions as industry moves in. 


BY SARA REARDON 


hen the government of Colombia 
and the left-wing guerrilla group 
Revolutionary Armed Forces of 
Colombia (FARC) signed an agreement in 
2016 to end five decades of conflict, the world 
celebrated. But that hard-won peace has come 
with a hidden cost. 
FARC guerrillas once occupied large 
swathes of Colombia's vast forests, includ- 
ing the Colombian Amazon. The fighters’ 


presence sent smallholder farmers fleeing to 
cities and discouraged development. But as 
FARC has moved out of the forests, industry 
has moved in — including logging, gold- 
mining and cattle-grazing. A government 
analysis found that deforestation increased by 
44% in the year of the peace accords. 

Now, scientists are racing to document 
Colombia's rich biodiversity, which is sec- 
ond only to that of Brazil. In the process, they 
are rediscovering ecosystems that were once 
largely off-limits. Earlier this year, more than 


40 researchers launched a digital platform 
to collate information on weather patterns, 
species distribution and other indicators 
of environmental health across Colombia. 
Their goal is to predict how the encroach- 
ing development could alter the country’s 
forests and other ecosystems — information 
that could shape policies on land use, water 
security and other environmental issues as 
Colombia adjusts to peace. 

If the researchers succeed, their efforts could 
yield insights that benefit other tropical 
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Plant samples collected by researchers at the Humboldt Institute in Bogota. 


nations grappling with climate change and 
> development. “These topics are relevant 
not just for Colombia,” says Alejandro Salazar, 
a soil biologist at Purdue University in West 
Lafayette, Indiana. 

For decades, researchers who wanted to 
study ecosystems in areas controlled by FARC 
and other armed groups had to get permission 
from the fighters. Some scientists persevered. 
Between 1988 and 2002, primatologist Pablo 
Stevenson ran a field station in the Macarena 
region, which is home to several primate spe- 
cies and was then under FARC control. The 
guerrillas would occasionally stop in for coffee, 
but generally left Colombian scientists alone, 
says Stevenson, now at the University of the 
Andes in Bogota. “They were very respectful.” 

Foreign researchers had a harder time — 
possibly because FARC thought they would 
command a high ransom. In 2002, Steven- 
son closed his field station, after guerrillas 
kidnapped a Japanese primatologist and 
demanded that Colombian universities pay to 
get him back. Still, the FARC guerrillas allowed 
the primatologist to keep working during his 
three-month captivity. One US biologist who 
was kidnapped twice by FARC even named a 
toad species Atelopus farci, because its mottled 
green skin resembled the guerrillas uniforms. 

But many scientists avoided such field trips 
altogether. Now, the peace agreement has got 
them excited about the future. “We've been 
exposed to science in conference rooms,” 
says Daniel Ruiz Carrascal, a geoscientist 
at the Antioquia School of Engineering in 
Medellin. “We haven't had the chance to go to 
the field.” He has begun building a network of 


> 
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temperature and humidity sensors in alpine 
regions to monitor climate change. 

The first wave of peacetime studies has 
already yielded surprising discoveries. Last 
year, biologists at the Alexander von Humboldt 
Biological Resources Research Institute in 
Bogota found six new species, including frogs 
and beetles, after just ten days of searching in 
one forested area near Medellin. 

To paint a broader picture of the environ- 
ment, Ruiz Carrascal and others launched the 
Platform for Ecological Analyses on Colombian 
Ecosystems (PEACE). Their goal is to connect 
data from satellites, monitoring systems and 
the small number of scientists who conducted 


fieldwork during the 

FARC conflict. “This is a part 
The PEACE oftheconflict 

consortium’s main nobody saw 

project isa‘datacube: coming.” 


an environmental 
model that simulates future change using infor- 
mation on atmospheric conditions, forest cover, 
fires and animal populations, among other 
factors. Researchers want to use the model to 
answer questions such as how ecosystems adapt 
to unusual conditions in years when the El Nifio 
weather pattern occurs. The researchers also 
hope to reveal how increased deforestation and 
human migration caused by Colombia's peace 
process might affect the environment. Such data 
could inform policies to protect ecologically 
important areas and limit exploitation of natural 
resources. 

Other questions about the future of con- 
servation efforts centre on Colombia's dismal 
record of enforcing its environmental laws. 
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Eighty-four per cent of the deforestation 
there so far has taken place in protected areas, 
says Juan Posada, an ecologist at El Rosario 
University in Bogota. The continued presence 
of illegal armed groups further complicates 
matters. “There are rules in place for biodiver- 
sity, but no one is respecting them, he says. 

And any conservation efforts also have to 
deal with the social issues that have arisen as 
Colombia tries to recover from decades of 
conflict. Since the early 1960s, when FARC 
emerged as a national force, more than 7 mil- 
lion people have fled their homes, largely in 
rural areas. Landowners who try to return 
to their abandoned plots are facing an unex- 
pected problem. Jungles that were once cleared 
for farmland grew back during FARC occupa- 
tion; the government has claimed much of this 
land as a natural resource that cannot be used 
for farming or timber. This has left hundreds of 
thousands of people with no source of income 
if they choose to move back to their land. 

“This is a part of the conflict nobody saw 
coming,” says Carlos Zuluaga, director- 
general of the Antioquia branch of Colombia's 
environmental protection agency, CORNARE. 

In 2013, CORNARE launched a pro- 
gramme that pays about 3,000 families up to 
600,000 Colombian pesos (US$200) a month 
to preserve their land, rather than developing 
it. Zuluaga says that some of these areas could 
be worth more to Colombia if they are kept 
ecologically intact, because of their role in the 
water and carbon cycles. 

Paying people to protect ecosystems has 
been tried in other countries, such as Brazil 
and Uganda. It can work well, at least in the 
short term, says Jennifer Alix-Garcia, an 
economist at Oregon State University in 
Corvallis. “The long-term sustainability is 
kind of unknown because most haven't been 
in place long enough,” she says. The key, she 
says, is for governments to choose the areas to 
protect carefully, and to enforce their agree- 
ments with landowners. Otherwise, people 
could take the government’s money even as 
they give developers access to their land. 

Even if enforcement remains weak, the 
PEACE researchers hope that their project — 
and any monitoring sites they set up — will 
at least help policymakers to understand the 
scope of the problem. “We are in a way redis- 
covering our country,’ Ruiz Carrascal says. m 


Travel for this story was paid for by the Pulitzer Center 
on Crisis Reporting in Washington DC. 
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China introduces sweeping 
reforms against misconduct 


Policies include creation of journal blacklist and assigning policing to government agency. 


BY DAVID CYRANOSKI 


hina is getting tough on scientific 

misconduct. The country’s most pow- 

erful bodies, the Chinese Communist 
Party and the State Council, introduced a raft 
of reforms on 30 May aimed at improving 
integrity across the research spectrum, from 
funding and job applications to peer-review 
and publications. 

Under the new policy, the Ministry of 
Science and Technology (MOST) will be 
responsible for managing investigations and 
ruling on cases of scientific misconduct, a role 
previously performed by individual institu- 
tions. And for the first time, misconduct cases 
will be logged in a national database that is 
currently being designed by MOST. 

Inclusion in the list could disqualify 
researchers from future funding or research 
positions, and might affect their ability to get 
jobs outside academia. The Chinese Academy 
of Social Sciences will oversee the same process 
for social scientists. 

The policy also states that MOST will 
establish a blacklist of ‘poor quality’ scientific 
journals, including domestic and international 
titles. Scientists who publish in these journals 
will receive a warning, and those papers will 
not be considered in assessments for promo- 
tions, jobs or grants. A couple of such blacklists 
already exist, but rarely are they run formally 
by a government agency. 

“Making it clear that articles published in 
‘bad’ journals wont count towards assessment 
of performance sends a strong signal,” says 
Paul Taylor, who heads a scientific-integrity 
programme at RMIT University in Melbourne. 
The plan to crack down on poor-quality and 
predatory journals is a good idea in practice, he 
says, but the ministry could find it difficult to 
identify problematic journals because some are 
more obvious than others — a challenge that 
curators of other blacklists have experienced. 
“Tt will be interesting to see the criteria that are 
developed to make these assessments,” he says. 

A start date for the reforms is yet to be 
announced, but is expected soon. Research- 
ers in China and abroad say the policy will 
have considerable impact. “These new rules 
will make a major difference over time,’ says 
Xue Lan, a science- and innovation-policy 
researcher at Tsinghua University in Beijing. 

Scientific misconduct is a significant 


The Chinese Academy of Sciences will work with the science ministry to set standards. 


problem in China, which has seen a steady 
stream of plagiarism cases, uses of fraudulent 
data, falsified CVs and fake peer reviews. 


HELD TO ACCOUNT 
Xue says the reforms are more practical than 
previous policies, which were based on general 
principles, such as improving researcher ethics, 
and were therefore hard to implement. “They 
lay out an accountability system in a detailed 
way that has never been seen before,” he says. 
As part of the reforms, the science ministry 
will work with agencies such as the Chinese 
Academy of Sciences to create standards for 
determining misconduct, protocols for moni- 
toring and investigating allegations, and rules 
for deciding on the severity of penalties accord- 
ing to the type of misconduct. The policy states 
that funding and jobs can be revoked. Although 
universities currently have these powers, some 
scientists say they are rarely applied. “The life- 
long accountability system will make everyone 
afraid to commit academic misconduct, says 
Yu Hailiang, a mechanical engineer at Central 
South University in Changsha, who blogs about 
science-integrity issues. The rules will help to 
establish a good academic atmosphere, he says. 
Science-policy researcher Tang Li from 
Fudan University in Shanghai also supports 
the reforms, although she worries that, if 
penalties are too harsh, it might prompt a 
backlash from researchers. She also warns 


that the ministry will need to protect whistle- 
blowers and researchers who are wrongly 
accused — something that is happening more 
now that researchers can publicize accusa- 
tions against each other online. Taylor would 
like to see some assurance that the investiga- 
tive process that led to people being named on 
a misconduct database was fair and rigorous. 

The new rules also state that institutions 
could have their funding revoked if they pro- 
tect researchers who have conducted serious 
misconduct. Nicholas Steneck, a researcher 
in scientific integrity at the University of 
Michigan in Ann Arbor, says the plan to pun- 
ish institutions and journals is unique, and 
the journal policy could be a model for other 
countries to follow. 

The policy also includes a plan to overhaul 
how researchers are evaluated for jobs and 
research grants. The current system places 
significant weight on the number of papers a 
scientist has published. But some researchers 
have noted that this encourages corner-cutting 
and fraud. The new rules call instead for uni- 
versities instead to consider quality as well as 
quantity, and to focus on overall innovation 
and impact as well as publication record. 

Efforts to change the culture of science in 
China will be key to reducing misconduct, says 
Yu. “As time goes on, these rules will go deep 
into the hearts of every researcher, allowing peo- 
ple to consciously resist academic bad habits.” m 
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Virologists testing samples for Ebola in Liberia in 2014. 


Fast Ebola test 
limits outbreak 


Health workers in the Democratic Republic of the Congo can 
diagnose the virus in hours, instead of days. 


BY DECLAN BUTLER 


ealth workers fighting the Ebola 
H epidemic that swept West Africa 

several years ago waited days, evena 
week, for the results of laboratory tests to detect 
the deadly virus. But in an Ebola outbreak 
that began in early April in the Democratic 
Republic of the Congo (DRC), this waiting time 
has shrunk to hours — thanks to a genetic test 
that was developed in response to the 2014-16 
West African epidemic. 

Researchers and health officials credit the 
faster tests with helping to contain the spread 
of Ebola in the DRC, by allowing infected 
people to be isolated and their contacts 
traced promptly. And should sparks from 
this outbreak light new fires in neighbour- 
ing countries, the nimbler test could help to 
avert a repeat of the devastating West African 
epidemic. 

The test involves a small machine called the 
GeneXpert, which is widely used across Africa 
to diagnose tuberculosis. The DRC’s govern- 
ment has made the GeneXpert its primary 
method of testing for Ebola in the current 
outbreak.“Labs have been set up with two to 
three days’ notice in new transmission zones, 
whereas in West Africa it took months and 
months to get facilities up and running,’ says 
Mark Perkins, head of laboratory networks 
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for the World Health Organization (WHO) 
Health Emergencies Programme. “It’s a 
remarkable change.” 

One of the biggest lessons of the West 
African epidemic — in which around 
29,000 people were infected and 11,000 died 
in Sierra Leone, Guinea and Liberia — was the 
need to diagnose cases of Ebola more quickly. 
Better lab tests could have averted 30-70% of 
cases and saved thousands of lives and billions 
of dollars, according to a May 2018 report by 
the Foundation for Innovative New Diagnos- 
tics in Geneva, Switzerland. 


RAPID RESPONSE 


With the GeneXpert test, health workers fighting the current 
outbreak of Ebola in the Democratic Republic of the Congo a 


Most Ebola lab diagnostics detect genetic 
sequences that are specific to the virus in 
blood, serum and other bodily fluids. During 
the West African epidemic, these manual tests 
required highly trained scientists working § 
in sophisticated tight-security labs that were | 
often far from outbreak zones. 

In the DRC, by contrast, the same tests are 
automated and are being performed more rap- 
idly and nearer to transmission zones by Gene- 
Xpert. The machine uses custom cartridges 
for different diseases and was developed for 
resource-poor settings. In response to the 
2014-16 epidemic, the machine’s manufac- 
turer developed a cartridge called Xpert Ebola 
to test for the Zaire strain of Ebola, which is 
behind the current DRC outbreak. The turn- 
around time from taking a sample to receiving 
a diagnosis in this outbreak is usually a matter 
of hours or at most a day, Perkins says. 

The DRC outbreak, which is mainly centred 
in remote regions of Equateur Province in the 
northwest of the country, is still relatively 
small, with 38 lab-confirmed cases, 14 prob- 
able cases and 14 suspected (see ‘Rapid 
response). As of 9 June, 28 of the people 
thought to have the virus had died. 

Health officials are cautiously optimistic that 
the outbreak can be stopped quickly. But the 
WHO is still worried that the virus could spread 
across the DRC and the rest of Central Africa. 
Some cases may have gone undetected, which 
could lead to resurgences of the disease. And for 
the first time in the DRC, cases have occurred 
in an urban area. 

Should further regional outbreaks occur, 
GeneXpert machines will continue to be a big 
help, officials say. There are already around 
150 of the machines in the DRC and several 
hundred in nearby countries for testing for 
tuberculosis and other diseases. By swapping 
in Xpert Ebola cartridges, a large Ebola-testing 
network could be quickly created, says Perkins. 

Thanks to lessons learnt from the West 
African outbreak, says Mara Jana Broadhurst, 
an Ebola-diagnostics specialist at Stanford 
University in California, “a new paradigm for 
Ebola virus detection and diagnosis is taking 
shape”. m 
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Price of sucking CO, 
from air plunges 


Technology moves closer to economic viability. 


BY JEFF TOLLEFSON 


phere could be more than an expensive 

last-ditch strategy for averting climate 
catastrophe. A detailed economic analysis 
published last week suggests that the geoengi- 
neering technology is inching closer to com- 
mercial viability. 

The study was conducted by researchers at 
Carbon Engineering in Calgary, Canada, which 
has been operating a pilot CO,-extraction plant 
in British Columbia since 2015. That plant — 
based on a concept called direct air capture — 
provided the basis for the economic analysis, 
which includes cost estimates from commer- 
cial vendors of all of the major components (D. 
W. Keith et al. Joule http://doi.org/cqqj; 2018). 


S iphoning carbon dioxide from the atmos- 


Depending on a variety of design options and 
economic assumptions, the cost of pulling 
1 tonne of CO, from the atmosphere ranges 
from US$94 to $232. By contrast, the previ- 
ous comprehensive analysis of the technology, 
conducted by the American Physical Society 
in 2011, estimated that it would cost $600 per 
tonne (see go.nature.com/2xuauq/). 

Carbon Engineering, which was founded 
in 2009, says that it published the paper to 
advance discussions about the approach’s cost 
and potential. “We’re really trying to com- 
mercialize direct air capture in a serious way,’ 
says David Keith, the company’s acting chief 
scientist and a climate physicist at Harvard 
University in Cambridge, Massachusetts. 

“It's great to see human ingenuity marshal- 
ling around a problem that at first pass seemed 
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to be intractable,” says Stephen Pacala, co- 
director of the carbon-mitigation initiative at 
Princeton University in New Jersey. He gives 
the Carbon Engineering team credit for pub- 
lishing its results and subjecting its proprietary 
technology to public scrutiny. 

The company’s design blows air through 
towers that contain a solution of potassium 
hydroxide, which reacts with CO, to form 
potassium carbonate. The result, after further 
processing, is a calcium carbonate pellet that 
can be heated to release the CO,. That CO, 
could then be pressurized, put into a pipeline 
and disposed of underground, but the com- 
pany is planning instead to use it to make 
synthetic, low-carbon fuels. Keith says that 
Carbon Engineering can produce these for a 
cost of about $1 per litre. When the company 
configured the air-capture plant for this pur- 
pose, it was able to bring costs down to as low 
as $94 per tonne of CO,. 

Klaus Lackner, a pioneer in the field who 
heads Arizona State University’s Center 
for Negative Emissions in Tempe, says that 
Carbon Engineering has taken a “brute- 
force” approach to driving down costs using 
known technologies. “They are coming within 
striking distance of making this interesting 
economically,’ he says. = 


MICROBIOLOGY 


Faecal transplants could 
help preserve vulnerable species 


New gut bacteria can expand the diets of animals such as koalas and rhinoceroses. 


BY SARA REARDON 


oalas are among the world’s fussiest 
kK eaters, consuming only the leaves 
of eucalyptus trees — and just a few 
varieties of eucalyptus at that. Research now 
suggests that the animals’ discriminating diet 
is determined in part by the bacteria that live 
in their guts, which seem to restrict an indi- 
vidual koala’s ability to digest certain species 
of eucalyptus. 

The finding, which was presented on 8 June 
at the annual meeting of the American Society 
for Microbiology (ASM) in Atlanta, Georgia, 
comes amid a growing interest in how an ani- 
mal’s microbiome influences its ability to adapt 
to environmental change. Scientists studying 
koalas and other vulnerable species are try- 
ing to find out whether altering an animal’s 
gut bacteria through its diet — or even faecal 
transplants — can increase survival. 

That is an urgent question for the koala 
(Phascolarctos cinereus), whose habitat in 


Australia is under threat from human activity. 
In some places, the koala population dwarfs 
the supply of eucalyptus — but even when the 
animals are transplanted to areas with abun- 
dant food, some die. Experiments by koala 
ecologist Ben Moore and his colleagues at 
Western Sydney University in Australia sug- 
gest that this might be due to an incompatibil- 
ity between available eucalyptus varieties and 
the mix of an individual koalas gut bacteria. 
Moore and his colleagues collected faeces 
from 200 koalas at 20 sites around Australia. 
When the researchers analysed the plant 
materials in the faeces, they found that some 
koalas ate only a highly nutritious eucalyptus 
species known as manna gum (Eucalyptus 
viminalis). Others ate less-nutritious messmate 
(E. obliqua), and only a fraction of the animals 
would eat both — even at the same site. 
When Moore and his colleagues analysed 
the microbial make-up of the faeces, they 
found that the koalas that preferred manna- 
gum eucalyptus harboured different bacteria 


from those that ate messmate. In an attempt to 
test whether the different diets were the cause 
or the result of the different microbiomes, the 
researchers transplanted faeces from six wild 
koalas that ate messmate into six wild koalas 
that preferred manna gum. Within 18 days, 
the microbiomes of the koalas that underwent 
the procedure were nearly identical to those of 
the donor animals. A few of the animals that 
received transplants also seemed more willing 
to eat messmate. 

To Moore, this suggests that koala-to-koala 
faecal transplants might help to expand the 
types of food available to individual animals, 
and increase their chances of survival. Eria 
Rebollar, a microbial ecologist at the National 
Autonomous University of Mexico in Mexico 
City, says that the koala study is one of the 
first demonstrations that fecal transplants can 
modify wild animals’ microbiomes. 

Other experiments suggest that some 
animals could benefit from having their 
microbiomes reshaped by faecal transplants > 
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Koalas can be very picky about what kind of eucalyptus they eat. 


> from another species. A team led by Denise 
Dearing, a molecular biologist at the University 
of Utah in Salt Lake City, found last year that 
desert woodrats (Neotoma lepida) — distant 
relatives of laboratory rats — carry gut bacteria 
that allow them to eat plants containing 
oxalate, a chemical that causes kidney stones. 
When the scientists, who work with Moore's 
group, transplanted woodrat faeces into lab 


rats, the lab rats gained the ability to degrade 
oxalate. 

In some cases, helping endangered species 
survive might require changing their diets to 
accommodate their existing microbiomes. 
At the ASM meeting, scientists from the 
San Diego Zoo in California presented find- 
ings that suggest how the gut microbiome of 
the near-threatened southern white rhinoceros 


(Ceratotherium simum simum) could interfere 
with its fertility. Captive-born southern white 
rhinos do not reproduce well. 

The team from the San Diego Zoo com- 
pared the faeces of captive white rhinos with 
those of one-horned rhinos (Rhinoceros 
unicornis), which reproduce well in captivity. 
The white-rhino faeces contained chemicals 
known as phytoestrogens, which are present 
in some plants and affect female reproductive 
hormones. Because both species of rhino ate 
the same diet, the researchers suspected that 
their gut microbes might break phytoestrogens 
down differently. 

To test this, the zoo workers switched the 
female white rhinos’ diet to grass pellets, 
which are low in phytoestrogens. Within two 
years, two females that had never successfully 
reproduced became pregnant; they later gave 
birth to healthy calves. Candace Williams, a 
molecular biologist at the San Diego Zoo, says 
that the facility is now feeding grass pellets to 
all its rhinos. She and her colleagues are trying 
to identify which bacteria might be responsible 
for the shift. 

Dearing predicts that science will soon 
reveal many more instances of animals’ micro- 
biomes affecting their ability to survive. “I 
think it’s more common than we've been able 
to document,” she says. “We just didn’t have the 
tools to do this until recently.” m 
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ILLUSTRATION BY KAROL BANACH 


TROUBLE IN THE 


PERIODIC TABLE 


Scientists are changing the rules for approving new elements 
in the wake of concerns over four recent discoveries. 


Sweden should have been upbeat when 

chemists and physicists gathered there for a symposium in May 
2016. The meeting, sponsored by the Nobel Foundation, offered 
researchers a chance to take stock of global efforts to probe the limits of 
nuclear science, and to celebrate four new elements that they had added 
to the periodic table a few months earlier. The names of the elements 
were due to be announced within days, a huge honour for the research- 
ers and countries responsible for the discoveries. 

Although many at the meeting were thrilled with how their field was 
developing — and the headlines it was generating — a significant number 
were worried. They feared that there were flaws in the process of assessing 
claims about new elements, and were concerned that reviews of the recent 
discoveries had fallen short. Some felt there was not enough evidence to 
justify enshrining the most controversial elements, numbers 115 and 117. 
The scientific integrity of the periodic table was at stake. 

Towards the end of the meeting, one scientist asked for a show of 
hands on whether or not they should announce the elements’ names as 
planned. The question exposed the depth of concern among the crowd. 
Most researchers voted to delay the announcement, says Walter Loveland, 
a nuclear chemist at Oregon State University in Corvallis. And that trig- 
gered a remarkable reaction from some of the Russian scientists who had 
led efforts that resulted in three of the elements. “They just stomped their 


T he mood at Backaskog Castle in southern 


BY EDWIN CARTLIDGE 


feet and walked out;’ says Loveland. “I've never 
seen that in a scientific meeting” 

Despite the concerns, the elements’ names were announced soon 
after. Nihonium (atomic number 113), moscovium (115), tennessine 
(117) and oganesson (118) joined the 114 previously discovered ele- 
ments as permanent additions to the periodic table. Nearly 150 years 
after Dmitri Mendeleev dreamed of this organizational structure, the 
seventh row of the table was officially complete. 

Yet the way in which events played out deeply upset some research- 
ers. Claes Fahlander, a nuclear physicist at Lund University in Sweden, 
expects that experimental results will eventually support the claims for 
moscovium and tennessine. Nevertheless, he maintains it was “prema- 
ture” to approve the elements. “We are scientists,’ he says. “We don't 
believe — we want to see proof.” 

As the world prepares to celebrate the International Year of the 
Periodic Table in 2019, debate over the four additions has forced reforms 
to the process for verifying other new elements in the future. And the 
controversy has cast a cloud of uncertainty over the bottom row of ele- 
ments: it is possible that the table’s governing bodies might reassess 
some of the latest discoveries. 

Part of the controversy stems from a rift between some chemists and 
physicists over who should be the legitimate custodians of the peri- 
odic table. Chemists have historically occupied that role, because they 
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discovered the naturally occurring elements through chemical techniques 
over centuries of work. 

For the past several decades, however, nuclear physicists have led the 
hunt for new elements — creating them artificially by smashing atomic 
nuclei into targets. It can take years to produce just one atom of these 
superheavy elements, which are also notoriously unstable, splintering 
through radioactive decay in sometimes fractions of a second. So, as 
groups have vied to be first to create the next elements, it has become 
more difficult to establish proof of their discoveries. 


SIBLING RIVALRY 

Responsibility for approving or rejecting new elements lies with two sister 
organizations: the International Union of Pure and Applied Chemis- 
try (IUPAC) and the International Union of Pure and Applied Physics 
(IUPAP). Since 1999, they have relied on the judgement of a panel of 
experts known as the joint working party (JWP), chaired by Paul Karol, a 
nuclear chemist and emeritus professor at Carnegie Mellon University in 
Pittsburgh, Pennsylvania. Re-established periodically to assess claims for 
discoveries as they arise, the latest version of the JWP assembled in 2012 
and disbanded in 2016. It consisted of Karol and four physicists. 

During that time, the group awarded credit for the discovery of 
elements 115, 117 and 118 toa Russian-US collaboration led by veteran 
nuclear physicist Yuri Oganessian of the 
Joint Institute for Nuclear Research (JINR) 7 
in Dubna, Russia. And the panel assigned W i A A E 
element 113 to researchers at the RIKEN 
Nishina Center for Accelerator-Based S [ | [ N ] | S ] S ' 
Science near Tokyo. W f ) 0) \ : T 

The JWP’s decisions were announced pub- 
licly on 30 December 2015, when IUPAC B ELIEVE 
issued a press release trumpeting the discov- 
eries of the four new elements (which had _ 
not yet received their formal names). Union 
officials said they had worked quickly to W W \ \ | | 0) 
broadcast the decisions. In fact, they made the s 
announcement before the mare executive § E E P RO 0 F. 
committee could approve the JWP’s conclu- 
sions, as is specified by the union’s published rules’; that approval came 
the following month. More controversially, the WP’s findings hadn't even 
been shown to the physics union, IUPAP, which had expected to see them, 
says Bruce McKellar of the University of Melbourne in Australia, who was 
IUPAP’s president at the time. 

That omission inflamed pre-existing tensions between the two unions. 
Cecilia Jarlskog, a physicist at Lund University and IUPAP president 
before McKellar, claims that, for years, the chemistry union has unfairly 
dominated the process of assessing discoveries. (Karol told Nature that in 
preparing the JWP’s reports, he liaised almost exclusively with the chem- 
istry union.) Venting her frustration at the 2016 Swedish meeting, she 
accused IUPAC of trying to steal the limelight by announcing the dis- 
covery on its own, and argued that only physicists “have the competence” 
to assess claims, according to the published version of her presentation’. 

On this occasion, tensions in the physics and chemistry communities 
were heightened by criticism over the JWP’s assessment of claims for 
elements 115 and 117. The JWP backed? the conclusions of the team 
that discovered these elements, which found that chains of radioactive 
decay from elements 115 and 117 matched up in a way that bolsters the 
evidence for both discoveries. But this kind of ‘cross-bombardment’ 
analysis is notoriously tricky for odd-numbered elements. Fahlander 
and his co-workers at the University of Lund report’ that the match-up 
is highly unlikely to exist for 115 and 117 — aconcern brought to the 
attention of the JWP in February 2015. 

Panel member Robert Barber, a nuclear physicist at the University of 
Manitoba in Winnipeg, Canada, says that although he and his colleagues 
“were very concerned” about cross-bombardment, they concluded there 
was no alternative to this type of evidence, and they reached consensus 
on all their decisions. Loveland also supports the overall decision. And 
even if the latest JWP got some details wrong, he says, history shows that 
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its decisions are unlikely to be reversed. 

However, Dubna nuclear physicist Vladimir Utyonkov takes aim at 
the JWP. Although he disagrees with the Lund group’s argument about 
cross-bombardment and is confident that the Russian-US claim is 
robust, Utyonkov maintains that the panel lacked “high-level” experts 
in heavy-element synthesis, and says that its draft reports contained 
numerous errors. Karol defends the work that he and his colleagues 
did as part of the JWP, saying that they tried to abide by the published 
criteria governing the assessment process. Overall, he says, “I believe 
the committee was extremely comfortable with its report”. 

But it seems that most delegates at the 2016 meeting in Sweden were 
critical of the JWP. David Hinde, a nuclear physicist at the Australian 
National University in Canberra, asked the 50 or so researchers present 
whether they thought the panel's findings were “scientifically satisfac- 
tory”. He says that he got very few positive replies to that question. 


REVIEW QUESTIONS 

Despite the various concerns, 'UPAC and IUPAP went ahead in June 2016 
and announced the names of the four new elements. McKellar admits 
that he had doubts about doing so, but says that most of the physicists and 
chemists he consulted told him that the JWP’s overall conclusions — ifnot 
all of the details of their analyses — were probably sound. 

Jan Reedijk, then president of [UPAC’s inorganic-chemistry division, 
says that the initial announcement was made early to avoid press leaks 
and to satisfy demands from the claimant labs, which were eager to get 
the news out. To enable that, he says, he quickly approved the JWP’s 
findings in December 2015 on behalf of his division, after it had been 
peer-reviewed and accepted for publication in the union’s journal Pure 
and Applied Chemistry. “I noted that the proper refereeing had been 
done, so gave my ‘yes’ in less than an hour,’ he says. 

However, it is unclear whether a truly independent review took place. 
According to the chemistry union’s executive director Lynn Soby, the 
JWP’s work was reviewed in a two-step process before the announce- 
ment. First, its findings went to several labs, principally ones involved 
in the latest discoveries as well as another reviewer suggested by one of 
the labs. Then the JWP’s reports were sent to members of the chemistry 
union’s committee on terminology, nomenclature and symbols. 

Soby says that the committee’s job was to check wording and format- 
ting errors, and that therefore it was down to the labs themselves to pro- 
vide scientific scrutiny. She says that was appropriate, given that they are 
the experts in that field. Yet one of those researchers, Utyonkov, thought 
that the chemistry union had recruited 15 independent experts to do 
the scientific review. He assumed that he and two Dubna colleagues had 
been asked to check only facts and figures in the reports. “I don’t know 
how we can be considered as independent referees,” he says. 

Looking back, Jarlskog wishes that she and the rest of the physics 
community had paid closer attention to how the entire assessment pro- 
cess was completed, particularly the refereeing of the JWP’s conclusions. 
“Tam going to have nightmares about how negligent we have been.” 

To address the concerns raised, the two unions have agreed on new 
procedures for assessing any future elements. According to the amended 
rules, which were released in May (see go.nature.com/2jil gv4), the presi- 
dents of IUPAC and IUPAP will now each get the chance to review the 
JWPs findings before announcing their conclusions together. To do so, 
they will carry out an independent peer-review process alongside that of 
Pure and Applied Chemistry. 

McKellar says that the changes will have a positive effect. “Each union 
has developed a good bit of trust working together on this,’ he says. 

But those changes wont satisfy some critics, such as Jarlskog. “I just 
don’t think that the new rules will change anything,” she says. m 


Edwin Cartlidge is a reporter in Rome. 
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Wandering albatrosses follow a vessel as it fishes for toothfish. 


Watch over Antarctic waters 


Ina rapidly changing climate, fisheries in the Southern Ocean must be managed 
cautiously in response to data, warn Cassandra Brooks and colleagues. 


ntartica is a “natural reserve, devoted 
Ae peace and science’, according to 

the Antarctic Treaty System. This 
complex set of agreements collectively takes 
a firm stance on conservation, exemplified 
by the Convention on the Conservation of 
Marine Living Resources. Adopted in 1980, 
this convention was negotiated rapidly in 
response to expanding trawling of Antarc- 
tic krill (Euphausia superba). Krill are at the 
base of the region’s marine food web, so there 
were worries that a dearth of the small crus- 
taceans would threaten the whole ecosystem, 
especially whales. 

The aim of the convention is to conserve 
all biota and ecosystems in the Southern 
Ocean. Although fishing is allowed, it is not 
aright and does not trump responsibility for 
conservation. The convention's provisions 


are strict, precautionary and science-based. 
Nations that are signatories must avoid 
significant or irreversible damage to fish and 
other animals that depend on them. 

But the convention is failing to protect the 
Southern Ocean from overfishing and the 
impacts of climate change. 

Up to 20 nations fish in these icy waters’. 
Antarctic krill and Patagonian and Antarctic 
toothfish (Dissostichus eleginoides and 
Dissostichus mawsoni) are the main quarry. 
More vessels and more-efficient fishing 
technologies are now able to catch more ani- 
mals (see ‘Antarctic fisheries’). Vessels using 
vacuum pumps can suck up 800 tonnes of 
krill in one day’. The vessels compete with 
birds and mammals for food, especially in 
the most accessible waters. 

At the same time, ocean temperatures, 


currents and weather patterns are changing’. 
The northwest coast of the Antarctic Pen- 
insula is one of the fastest-warming places 
on Earth — summer mean temperatures 
are on average 3 °C higher than they were in 
1950. Diminishing sea ice also means fewer 
algae, krill and Antarctic silverfish (Pleura- 
gramma antarctica). Cumulative impacts of 
historical and current fishing combined with 
environmental change have been linked to 
declines in populations of Chinstrap* and 
Gentoo penguins’ (Pygoscelis antarctica and 
Pygoscelis papua). 

Because of the convention's strict 
provisions, its 25-member implementing 
body — the Commission for the Conserva- 
tion of Antarctic Marine Living Resources 
(CCAMLR) — is widely seen as a leader 
in high-seas fisheries management. 
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> But some fishing states are now trying 
to weaken the convention's rules. China, 
CCAMLR’s newest member (as of 2007), 
argues that the convention enshrines 
nations’ rights to fish, rather than a respon- 
sibility to conserve’. China has also insisted 
that no-fishing zones are contrary to the 
convention, even though they are expressly 
included. And it has proposed that scientific 
evidence of a threat is required before an 
area is closed to fishing. Some other fishing 
nations, such as Russia, support this view’. 

Because CCAMLR operates by consen- 
sus, any state can block a measure that it 
perceives is not in its interests. For instance, 
in 2011, South Korea prevented the black- 
listing of one of its vessels that was caught 
fishing illegally*. 

If fishing carries on at its current pace 
amid rapid climate change, prime Antarctic 
fisheries and marine ecosystems could col- 
lapse, as has happened elsewhere’. For exam- 
ple, in the 1990s, after politicians failed to 
act on scientists’ warnings, the abundance 
of Atlantic cod (Gadus morhua) declined to 
less than 1% of historical levels’. 

We urge CCAMLR to better protect 
fisheries in the Southern Ocean. The impacts 
of climate change on populations now and in 
the future should be factored into decision- 
making, to avoid crashes in populations. 
CCAMLR may need to reduce or stop fish- 
ing in threatened areas or where there is high 
uncertainty about adverse effects. Marine 
reserves must be well designed, and more of 
them must be implemented. 

CCAMLR should also do more to sup- 
port basic research that is independent of 
the fishing enterprise. Such studies will lead 
to greater understanding of the dynamics of 
targeted species and their vulnerabilities to 
environmental change and overfishing. 


FISHING PRESSURE 

Antarctic waters have long been plundered 
(see also page 184). Species driven almost to 
extinction include elephant seals (Mirounga 
leonina), blue whales (Balaenoptera muscu- 
lus), king penguins (Aptenodytes patagonica) 
and marbled rockcod (Notothenia rossii)"”. 
Some have bounced back; others haver’t, 
such as the rockcod. Even so, remoteness 
and harsh conditions have protected animals 
in the seas around Antarctica, in comparison 
with those elsewhere. 

In the Southern Ocean, more krill are 
caught than any other species (by weight). 
About 300,000 tonnes are caught annually. 
They are mainly destined for omega-fatty- 
acid supplements and fishmeal. Most krill 
are caught off the Antarctic Peninsula. The 
industry says that such catches are small, 
compared with the more than 300 million 
tonnes of krill estimated to reside in circum- 
polar waters”. 

Patagonian and Antarctic toothfish each 
support relatively small fisheries in the 
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An emperor penguin 
and an ice-breaker in 
the Ross Sea. 


Southern Ocean (see ‘Antarctic fisheries’). 
Owing to high prices, this is lucrative. Exploi- 
tation rocketed in the 1990s, when toothfish, 
rebranded as Chilean sea bass, became popu- 
lar in top restaurants. Illegal, unreported and 
unregulated fishing soared and ravaged popu- 
lations; illegal fishers took six times more fish 
than did legal vessels’. CCAMLR turned this 
situation around by documenting catches, 
monitoring vessels and black-listing those 
that did not comply. Illegal catches fell from 
33,000 tonnes in 1996 to less than 2,000 tonnes 
by 2007 (ref. 11). Nonetheless, many Patagon- 
ian toothfish populations crashed, and remain 
depleted, notably 


those around the “Some scenarios 

Prince Edward idyrequire 

Islands, BANZARE reduced 

BankandKerguelen C@ic hes, or 

Plateau. allocations that 
The life cycles arerestricted, 

of toothfish make temporally or 


them particularly 
vulnerable, as well 
as difficult to study. They mature late, grow 
slowly and can live for 50 years. No one 
knows how many there are in the Ross Sea, 
the main international fishery for tooth- 
fish. Nor does anyone know when, where 
or how often they spawn’. They are the top 
fish predator in the Southern Ocean, and 
they are also key prey of Weddell seals (Lep- 
tonychotes weddellii) and killer whales (Orci- 
nus orca), and compete for smaller fish with 
Adélie penguins (Pygoscelis adeliae). 
Fishing states want more. Russia is trying 


spatially.” 


to increase its toothfish catch and send its 


fleets into unfished areas such as the Weddell 
Sea’. Ukraine wants to capture more krill’. 
New Zealand and Australia, among others, 
have extended their reach into toothfish 
areas’. Other nations, including Namibia 
and Uruguay, signed the convention to gain 
fishing and market access’. 


KNOWLEDGE GAPS 

Climate change compounds the problem. 
But environmental effects are difficult to 
disentangle from the consequences of fish- 
ing. For example, scientists do not know 
whether toothfish fishing or encroaching ice 
is behind the changing prevalence of killer 
whales in the southern Ross Sea”. 

More data would help. But research in the 
Southern Ocean is difficult and expensive. 
Much of what is known about toothfish is 
gathered by the fishing industry, which does 
not collect environmental data. There are 
many gaps. We have much to learn about 
the life histories and population dynamics 
of the species being caught and how environ- 
mental changes affect their birth and death 
rates. There are few quantitative studies of 
connections among targeted organisms in 
the food web. 

The long-term ecological research (LTER) 
programme, based at Palmer Station on the 
west of the Antarctic Peninsula, is unparal- 
lelled in its multi-faceted approach. It gathers 
information, for example, on how fluctuat- 
ing sea ice influences krill and other small 
organisms. CCAMLR also has an ecosystem 
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monitoring programme designed to study 
krill-fishing impacts on land-breeding marine 
predators. However, these data are not effec- 
tively incorporated into decision-making. 
CCAMLR acknowledges that it must 
account for the impacts of global warming 
in its policies. In 2017, it produced a Climate 
Change Response Work Programme to spec- 
ify research and monitoring requirements 
and potential actions”. It says it will engage 
climate-change experts. But progress is slow, 
and climate change is not. CCAMLR’s rules 
and catch allocations are still based on models 
that do not consider climate-change scenarios. 


ADAPTIVE ANALOGUES 

Responsive, ecosystem-based fisheries man- 
agement is still being developed. Lessons are 
emerging from around the world. CCAMLR 
already has the policy tools to benefit. 

Adaptive management is in place off the 
US West Coast. Since 2015, on the basis of 
stock surveys and climate indicators, the 
Pacific Fisheries Management Council has 
temporarily banned fishing of sardines 
(Sardinops sagax). The closure of this large 
fishery (109,000 tonnes in 2012) is bringing 
hardship now. But higher levels of fishing 
may be possible in future when indicators 
allow. 

A moratorium on fishing in the polar 
waters of the Arctic is relevant to the Southern 
Ocean. The 2.8-million-square-kilometre area 
was not fished before 2017 because it was fro- 
zen for much of the year. Now, as around the 
Antarctic Peninsula, reductions in summer ice 


ANTARCTIC FISHERIES 


Krill and toothfish received the largest catch allowances in the Southern Ocean in 
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2017-18. They are increasingly exploited in spite of tight management by the 


Commission for the Conservation of Antarctic Marine Living Resources (CCAMLR). 


are making the area more accessible. In 2017, 
states bordering the Central Arctic Ocean 
adopted the Arctic agreement, pledging not 
to fish there for 16 years, to allow scientists to 
study of the impacts of climate change first. 

In other words, countries can come 
together to protect sensitive fisheries and 
the environment, and to support research. 
The moratorium came about because 
bordering nations were concerned about 
over-harvesting. For example, Bering Sea 
pollock (Gadus chalcogrammus) has yet to 
recover from stock depletion in the 1980s 
and 1990s. This prompted 2,000 scientists 
to petition for a fishing ban in 2012. Three 
years later, the ban was implemented by the 
United States, Norway, Denmark, Canada 
and Russia. China, Japan, Korea, Iceland 
and the European Union have joined since. 

Such multi-national cooperation bodes 
well for the development of strategies for 
future Arctic fishing that are precautionary 
and ecosystem-based. Most of these coun- 
tries are also CCAMLR members. Their 
willingness to show restraint in the face of 
uncertainty in the north could be paralleled 
in the south. 


THREE SOLUTIONS 
Given the threat posed by climate change, 
what are the conditions under which 


@ 38-63 


fishing can continue and still meet the 
precautionary provisions of the conven- 
tion? To avoid passing tipping points in the 
marine food web, CCAMLR needs to take 
the following three steps. 

Implement more and better-designed 
marine reserves. CCAMLR has established 
two, one in the Ross Sea and one in the 
South Orkney Islands Southern Shelf, off 
the eastern tip of the Antarctic Peninsula. 

Neither includes comparable reference 
areas for monitoring fishing against environ- 
mental impacts. The Ross Sea protections are 
set to expire 35 years after they began, which is 
less than the lifespans of many of the animals 
intended for protection, such as toothfish. 
Future closures must ban fishing in the most 
ecologically crucial areas. The protections 
should last at least as long as the life expec- 
tancies of the animals being safeguarded. And 
they should include comparable reference 
areas outside the no-fishing zone. 

Incorporate climate-change scenarios 
into decision rules. Current manage- 
ment measures, including catch quotas, 
are based on models that do not include 
climate-change scenarios. An environmen- 
tal shift could cause a population crash in 
the harvested species or some other spe- 
cies in its food web. To protect against these 
crashes — and to comply with the provisions 


14 JUNE 2018 | VOL 558 | NATURE | 179 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


of the convention — CCAMLR must be 
more precautionary and adaptive. This 
may mean that quotas are reduced, or that 
allocations are more temporally and spa- 
tially explicit. Ifthe threat of overfishing 
is deemed readily apparent, or if the level 
of uncertainty is too high, then CCAMLR 
may need to temporarily close regions of 
the Southern Ocean to fishing. 

Develop more-robust research and 
monitoring programmes. The Scien- 
tific Committee on Antarctic Research 
(SCAR) should first compile the avail- 
able information and ongoing research 
regarding the effects of climate change 
and fish populations in Southern Ocean 
ecosystems. The committee undertook 
these analyses for krill, before establish- 
ing the CCAMLR convention. SCAR 
should then work with CCAMLR sci- 
entists, independent experts and non- 
governmental organizations to identify 
crucial questions, and what is required 
to answer them. CCAMLR needs to be 
more transparent and to invite SCAR and 
other independent experts into its scien- 
tific working groups, from which they are 
currently excluded. 

Governments that are part of 
CCAMLR will need to fund the research 
and monitoring efforts, which must be 
independent of the fishing industry. The 
Palmer LTER programme shows that the 
techniques are available, but investment 
is needed to expand the scientific reach. 

CCAMLR states have acted quickly 
in the past, but change is accelerating in 
the Southern Ocean. Countries must rise 
swiftly to this challenge. m SEE INSIGHTP.199 
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Pandemics: spen 
on surveillance, 
not prediction 


Trust is undermined when scientists make overblown 
promises about disease prevention, warn Edward C. 
Holmes, Andrew Rambaut and Kristian G. Andersen. 


r he resurgence of Ebola virus in the 
Democratic Republic of the Congo 
this May is a stark reminder that 

no amount of DNA sequencing can tell us 

when or where the next virus outbreak will 
appear. More genome sequence data were 
obtained for the 2013-16 Ebola epidemic 
than for any other single disease outbreak. 

Still, health workers in Mbandaka, the 

country’s northwestern provincial capi- 

tal, are scrambling to contain a growing 
number of cases. 

Over the past 15 years or so, outbreaks 
caused by viruses such as Ebola, SARS 
and Zika have cost governments billions 
of US dollars. Combined with a percep- 
tion among scientists, health workers and 
citizens that responses to outbreaks have 
been inadequate, this has fuelled what 


HIGH STAKES 


Estimated cost of surveying 1.67 million animal 
viruses is equal to one-quarter of the NIAID’s 
budget for infectious-diseases research. 


Requested amount for 
Global Virome Project 


seems like a compelling idea. Namely, that 
if researchers can identify the next pan- 
demic virus before the first case appears, 
communities could drastically improve 
strategies for control, and even stop a virus 
from taking hold’”. Indeed, since 2009, the 
US Agency for International Development 
has spent US$170 million on evaluating 
the “feasibility of preemptively mitigating 
pandemic threats”. 

Various experts have flagged up prob- 
lems with this approach (including the 
three of us)**. Nonetheless, an ambitious 
biodiversity-based approach to outbreak 
prediction — the Global Virome Project 
— was announced in February this year, 
with its proponents soliciting $1.2 billion 
in funding from around the world (see 
‘High stakes’). They estimate that other 
mammals and birds contain 1.67 mil- 
lion unknown viruses from the families 
of viruses that are most likely to jump to 
humans, and will use the funding to con- 
duct a genomic survey of these unknown 
viruses, with the aim of predicting which 
might infect people’. 

Broad genomic surveys of animal viruses 
will almost certainly advance our under- 
standing of virus diversity and evolution. 
In our view, they will be of little practical 
value when it comes to understanding and 
mitigating the emergence of disease. 

We urge those working on infectious 
disease to focus funds and efforts on a 
much simpler and more cost-effective way 
to mitigate outbreaks — proactive, real- 
time surveillance of human populations. 

The public has increasingly questioned 
the scientific credibility of researchers 
working on outbreaks. In the 2013-16 
Ebola epidemic, for instance, the inter- 
national response was repeatedly criti- 
cized for being too slow. And during the 
2009 HIN1 influenza epidemic, people 
asked whether the severity of the virus 
had been overblown, and if the stockpiling 
of pharmaceuticals was even necessary’. 
Making promises about disease prevention 
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A health official checks for Ebola symptoms by taking the temperature of passengers arriving at Mbandaka Airport in the Democratic Republic of the Congo. 


and control that cannot be kept will only 
further undermine trust. 


FORECASTING FALLACY 

Supporters of outbreak prediction main- 
tain that if biologists genetically character- 
ize all of the viruses circulating in animal 
populations (especially in groups such 
as bats and rodents that have previously 
acted as reservoirs for emerging viruses), 
they can determine which ones are likely to 
emerge next, and ultimately prevent them 
from doing so. With enough data, coupled 
with artificial intelligence and machine 
learning, they argue, the process could be 
similar to predicting the weather’. 

Reams of data are available to train mod- 
els to predict the weather. By contrast, it 
is exceedingly rare for viruses to emerge 
and cause outbreaks. Around 250 human 
viruses have been described, and only a 
small subset of these have caused major 
epidemics this century. 

Advocates of prediction also argue that 
it will be possible to anticipate how likely 
a virus is to emerge in people on the basis 
of its sequence, and by using knowledge 
of how it interacts with cells (obtained, for 
instance, by studying the virus in human 
cell cultures). 

This is misguided. Determining which 


of more than 1.6 million animal viruses 
are capable of replicating in humans and 
transmitting between them would require 
many decades’ worth of laboratory work in 
cell cultures and animals. Even if research- 
ers managed to link each virus genome 
sequence to substantial experimental 
data, all sorts of other factors determine 
whether a virus jumps species and emerges 
in a human popu- 


lation, suchasthe “Ihe challenge is 
distribution and to link genomic, 
density of animal clinical and 
hosts. Influenza epidemiological 
viruses have cir- datawithin days 
culated in horses of an outbreak 


since the 1950s 
and in dogs since 
the early 2000s, for instance’. These viruses 
have not emerged in human populations, 
and perhaps never will — for unknown 
reasons. 

In short, there aren't enough data on 
virus outbreaks for researchers to be able to 
accurately predict the next outbreak strain. 
Nor is there a good enough understand- 
ing of what drives viruses to jump hosts, 
making it difficult to construct predictive 
models. 

Biodiversity-based prediction also 
ignores the fact that viruses are not fixed 


being detected.” 


entities. New variants of RNA viruses 
appear every day. This speedy evolution 
means that surveys would need to be done 
continuously to be informative. The cost 
would dwarf the proposed $1.2-billion 
budget for one-time sequencing. 

Even if it were possible to identify which 
viruses are likely to emerge in humans, 
thousands of candidates could end up 
being identified, each with a low probabil- 
ity of causing an outbreak. What should be 
done in that case? Costs would skyrocket 
if vaccines and therapeutics were proposed 
for even a handful of these. 


SCREEN AND SEQUENCE 

Currently, the most effective and realistic 
way to fight outbreaks is to monitor 
human populations in the countries and 
locations that are most vulnerable to infec- 
tious disease. This can be done by local 
clinicians, health workers in non-govern- 
mental organizations such as Médecins 
Sans Frontiéres (MSF; also known as 
Doctors Without Borders), and global 
institutions such as the World Health 
Organization (WHO). 

We advocate the detailed screening 
of people who are exhibiting symptoms 
that cannot easily be diagnosed. Such 
tests should use the latest sequencing 
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technologies to characterize all the 
pathogens that have infected an individ- 
ual — the human ‘infectome’™. To track pre- 
vious infections, investigators should also 
assess each person's immune response, by 
analysing components of their blood using 
broad-scale serology’. 

Emerging diseases are commonly associ- 
ated with population expansions — when 
people encroach on habitats occupied by 
animals — as well as with environmental 
disturbances and climate change. Defor- 
estation, for instance, can promote human 
interactions with animals that carry new 
threats, and can increase encounters with 
new vector species such as ticks and mos- 
quitoes’’. Animal die-offs, for example 
that of bar-headed geese (Anser indicus) 
at Lake Qinghai in China in 2005 (which 
was caused by the H5N1 influenza virus), 
can also flag problem regions or emerg- 
ing pathogens. Surveillance efforts should 
therefore focus on communities that live 
and work in such environments. 

Identifying which pathogen is causing 
an outbreak is no longer the bottleneck it 
once was. It took researchers two years to 
determine HIV as the cause of AIDS in the 
early 1980s using microscopy and other 
techniques. By contrast, in 2012 it took only 
weeks for investigators using genomic tech- 
nologies to discover the coronavirus that 
caused Middle East respiratory syndrome 
(MERS). 

Rapid identification of viruses can be 
achieved only if such technologies — and 
the people trained to use them — are glob- 
ally available, including in resource-limited 
regions where the risk of outbreaks might 
be higher. Thankfully, relevant capacity- 
building programmes are now beginning 
to be established, such as the Human 
Heredity and Health in Africa (H3Africa) 
Initiative, run by the UK Wellcome Trust 
and the US National Institutes of Health”. 

Once an emerging outbreak virus has 
been identified, it needs to be analysed 
quickly to establish what type it is; which 
molecular mechanisms (such as receptor 
type) enable it to jump between individu- 
als; how it spreads through human popula- 
tions; and how it affects those infected. In 
other words, at least four kinds of analysis 
are needed: genomic, virological, epidemi- 
ological and clinical. And the data must be 
passed to key stakeholders, from research- 
ers and health workers on the ground to 
international agencies such as the WHO 
and the MSF. Data must be kept as free of 
restrictions as possible, within the con- 
straints of protections of patient privacy 
and other ethical issues. 

This will best be achieved through 
an established global network of highly 
trained local researchers, such as the WHO 
Global Outbreak Alert and Response 
Network (GOARN). Real-time tools for 
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People in Mbandaka are taking extra precautionary measures to stop the spread of Ebola virus. 


reconstructing and tracking outbreaks 
at the genomic level, such as portable 
sequencing devices, are improving fast’. 
Information gathered during recent out- 
breaks has quickly had tangible impacts 
on public-health decisions, largely owing 
to data generation and analysis by many 
research teams within days of people being 
infected”. 

For instance, in the 2013-16 Ebola 
epidemic, genome sequencing of the 
virus proved that a person could sexually 
transmit the disease more than a year after 
becoming infected. This prompted the 
WHO to increase 
its recommended 


beraba ssi “Thereis a 
TUEDDEROLIES : or growing move 
persistent infec towards real- 
tion in survivors 

; time, open- 
of the disease. 
access data and 


Ultimately, the 
challenge is to link 
genomic, clini- 
cal and epidemiological data within days 
of an outbreak being detected, includ- 
ing information about how people in an 
affected community are interacting. Such 
an open, collaborative approach to tackling 
the emergence of infectious disease is now 
possible. This is partly thanks to technol- 
ogy, but is mainly due to a shift in percep- 
tion about the importance of this approach. 
At least in genomic epidemiology, there is 
a growing move towards real-time, open- 
access data and analysis, aided by the use 
of preprint servers and wikis such as Viro- 
logical (http://virological.org). This type 
of collaborative effort can complement 
the work of agencies including the WHO 
and the MSF, which focus predominantly 
on providing information, isolating those 
who have been infected, and so on. 


analysis.” 


So far, researchers have sampled little 
of the viral universe. Surveys of animals 
will undoubtedly result in the discovery 
of many thousands of new viruses. These 
data will benefit studies of diversity and 
evolution, and could tell us whether and 
why some pathogens might jump species 
boundaries more frequently than others. 
But, given the rarity of outbreaks and the 
complexity of host-pathogen interactions, 
it is arrogant to imagine that we could use 
such surveys to predict and mitigate the 
emergence of disease. 

New viruses will continue to emerge 
unexpectedly. There is a lot we can and 
must do to be better prepared. = 
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Sperm whales (Physeter macrocephalus) socialize in the Indian Ocean. 


MARINE CONSERVATION 


Sea changes and whale tales 


Sascha Hooker on three books tackling challenges faced by oceans and marine life. 


hey cover more than two-thirds 
Ts Earth’s surface, support much 

of its biodiversity, produce more 
than 50% of planetary oxygen and 
absorb 20-35% of human-created carbon 
dioxide emissions. Oceans were once 
considered boundless resources, yet, by 
1983, pioneering whale conservationists 
Stephen Leatherwood and Randall Reeves 
were writing in The Sierra Club Handbook 
of Whales and Dolphins that “the seas are 
by no means dead, but they are unques- 
tionably less alive than they were when 
humanity discovered them”. 

Ecological connections between species 
form a tangled web. Intentional perturba- 
tions can lead to unintended consequences, 
or even shift an ecosystem from one state 
into another. With the human population 


soaring towards 8 billion — consuming, 
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Spying on Whales: The Past, Present, and 
Future of Earth’s Most Awesome Creatures 
NICK PYENSON 

Viking (2018) 


The Curious Life of Krill: A Conservation 
Story from the Bottom of the World 
STEPHEN NICOL 

Island (2018) 


Orca: How We Came to Know and Love the 
Ocean’s Greatest Predator 

JASON M. COLBY 

Oxford University Press (2018) 


polluting and emitting as it grows — oceans 
face unprecedented challenges, as do their 
denizens. Nearly 3 million whales were killed 
in the twentieth century; the animals must 
now negotiate hazards such as ship strikes 
and noise. And global harvests of ocean fish 
by humans have hit 1 trillion a year. 


Now, three books — by palaeobiologist 
Nick Pyenson, eminent krill scientist 
Stephen Nicol and historian Jason Colby 
— explore oceans from the perspectives 
of whales and krill across palaeontologi- 
cal, decadal and even annual timescales. 
They examine the evolutionary history 
and ecology that led to today’s ecosystems, 
and provide insights into ocean-resource 
management and how changing public 
sentiment influences the politics behind this. 

Pyenson’s Spying on Whales is a palae- 
ontological howdunnit embedded in a 
travelogue devoted to chasing living and 
extinct whales. The author — curator of 
fossil marine mammals at the Smithsonian 
Institution in Washington DC — takes us 
to field sites from Antarctica to Alaska. 
His narrative captures the excitement of 
suction-cup tagging of humpback whales, 
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and of digs in Panama, seeking answers to 
deep questions in cetacean science. Why are 
today’s blue whales the largest animals ever 
to exist? How can past worlds with higher 
sea levels and more-acidic oceans inform us 
about the future of today’s oceans as climate 
change takes hold? 

Pyenson likens palaeontology to astron- 
omy for its capacity to transport us in 
imagination to places never experienced. At 
Cerro Ballena in northern Chile, he uncov- 
ers the densest fossil-whale site yet found, 
with bones dating to between 6 million and 
9 million years ago. Once a tidal flat, the site 
preserved dozens of whale skeletons. Over 
at least four separate episodes, these whales 
and marine mammals unique to South 
America (walrus-faced whales and aquatic 
sloths), were probably killed en masse by 
harmful algal blooms. 

The ethics of marine-mammal science 
can be fraught: research merit, benefits to 
the population and animal suffering must 
be weighed up. Many in cetacean science 
consider it unethical to use specimens from 
commercial whaling operations, arguing 
that it supports their continuation. Pyenson 
is one of few researchers today who collect 
data, as he puts it, at this “uncomfortable 
intersection” Yet his passion for research 
shines through as he describes the discovery 
of a previously unknown sensory organ 
between the unfused lower jawbones 
of rorqual whales, which might help to 
coordinate the enormous biomechanics 
of their gulp when feeding. Clearly also a 
conservationist at heart, he recognizes that 
whaling has had much broader ecological 
consequences than originally imagined. 

Nicol’s The Curious Life of Krill takes 
the reader deep into the Southern Ocean 
to look at the most abundant multicellular 
marine organism by weight. Antarctic krill, 
crustaceans that grow up to 6 centimetres 
long and feed whales, seals and seabirds, 
have a 400-million-tonne global biomass. 
Nicol writes passionately about their biol- 
ogy, exploitation and management. He 
notes how fisheries have been hampered by 
krill biology, because the creatures’ diges- 
tive enzymes cause rapid degradation after 
death. (Remarkably, these enzymes also 
have antibiotic properties that could ulti- 
mately move krill oil from health food to 
mainstream medicine.) 

Nicol also delves into the influence of 
whales and whaling on ocean produc- 
tivity, noting that the twentieth-century 
decimation of the mammals has not 
resulted in the krill surplus envisioned. It 
now seems likely that all whales have a vital 
role as ecosystem engineers. They transfer 
nutrients from the depths, where they feed, 
to surface waters, where their defecation 
spreads fertilizing iron. That sustains phy- 
toplankton and, in turn, krill. This finding 
changes how we view marine ecosystems: 
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Thysanoessa krill swarm off California. Krill underpin many marine ecosystems. 


no longer as a simple producer-consumer 
economy, but as complex, interlocking eco- 
logical webs. These involve consumption, 
but also recycling of nutrients and minerals, 
so that 3D movements of organisms, sedi- 
ment and suspension are also crucial. 

Colby’s Orca describes a different kind of 
catch: the killer whale (Orcinus orca) live- 
capture industry in the Pacific Northwest of 
North America. From 1961 to 1976, 270 of 
the animals were corralled behind nets and 
50 of them were 


taken fordisplayin “Quy human 
aquariums. Colby tendency to 
vividly chronicles shoot first and 
a rapid change in ask questions 
public sentiment, I qter is evident 
from acceptance 5, whaling, krill 
of shooting killer fishing and the 
whales as feared li od 

ive capture of 


pests in the 1960s 
to protest and sab- 
otage of capture 
attempts in the 1970s. Colby, whose father 
helped with some of the captures, weaves 
into the narrative first-hand interviews with 
the saga’s main players. The result is immer- 
sive and dramatic. Live captures were halted 
in North America in 1976, and attention 
then turned to the release of captive whales. 
However, vast costs and lack of success in 
freeing the killer whale Keiko (brought to 
fame in the 1993 film Free Willy) suggest that 
conservation efforts would be better focused 
on the habitats and prey of remaining wild 
populations. 

Our human tendency to shoot first and 
ask questions later is evident in whaling, 
krill fishing and the live capture of killer 
whales. Consumption is only later fol- 
lowed by research programmes to establish 


killer whales.” 


management regulations. Among those 
landmark pieces of legislation are the US 
Marine Mammal Protection Act of 1972 
and the 1980 international Convention 
on the Conservation of Antarctic Marine 
Living Resources, described by Colby 
and Nicol, respectively. Both measures 
are rightly applauded for their ecological, 
precautionary and data-driven approaches. 
However, both are still predicated on 
quotas for removal, or ‘take’. Nicol briefly 
mentions the focus in recent years on 
marine protected areas — initiatives that 
can prohibit fishing and provide refuges 
for marine species. These present an alter- 
native management tool that should be 
quantitatively considered in the evalua- 
tion and design of what needs protection, 
and where. 

Spying on Whales, The Curious Life of Krill 
and Orca acknowledge that our oceans are 
emptier now than they were only 200 years 
ago. Our ‘shifting ecological baselines’ 
lead us to value the best of the worst of our 
impoverished systems today — to marvel 
at only moderate productivity. Whereas 
Pyenson and Nicol help to reset our expec- 
tations and wake us from our societal and 
generational amnesia, Colby demonstrates 
the speed at which societal attitudes can also 
shift the baseline of our expectations. In this 
age of extinction, with ongoing changes in 
ocean chemistry and physics, it is the poten- 
tial for a sea change in public attitude that 
presents hope. m 
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The Large Hadron Collider at CERN has ruled out many elegant physics theories. 


Beauty, proof and the 
crisis in physics 


Anil Ananthaswamy parses Sabine Hossenfelder’s 
analysis of why the field is at an impasse. 


cc hy should the laws of nature 
care about what I find beau- 
tiful?” With that statement, 


theoretical physicist and prolific blogger 
Sabine Hossenfelder sets out to tell a tale 
both professional and personal in her new 
book, Lost in Math. It explores the morass 
in which modern physics finds itself, thanks 
to the proliferation of theories devised using 
aesthetic criteria, rather than guidance from 
experiments. It also charts Hossenfelder’s 
own struggles with this approach. 
Hossenfelder — a research fellow 
specializing in quantum gravity and modi- 
fications to the general theory of relativity at 
the Frankfurt Institute for Advanced Studies 
in Germany — brings a trenchant new voice 
to concerns that have been rumbling in 
physics for at least two decades. In 2006, 
Lee Smolin’s The Trouble with Physics and 
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Peter Woit’s Not Even 
Wrong fired the first 
salvos at the trend of 
valuing mathematical 
elegance over empiri- 
cal evidence. Both 
books took on string 
theory, a ‘theory of 
everything’ in which 
the fundamental con- 


ee bele stituents of nature 
Physics Astray are strings vibrating 
SABINE in many more spatial 
HOSSENFELDER dimensions than the 
Basic (2018) familiar three. Since 


its entry into main- 

stream physics in the mid- 1980s, the theory 

has failed to make predictions that would 
unambiguously verify or falsify it. 

Hossenfelder, too, tackles string theory, 


but her broadsides are more basic. She points 
to the paucity of experimental data, exac- 
erbated as the machines needed to probe 
ever higher energies and smaller distances 
become more costly to build. Given that, she 
is worried that too many theorists are using 
mathematical arguments and subjective 
aesthetics to judge a theory’s validity. 

For example, Hossenfelder questions the 
desire for naturalness — the idea that a theory 
should not be contrived or have parameters 
that have to be fine-tuned to fit observations. 
The standard model of particle physics feels 
like such a contrivance to many physicists, 
despite its spectacular success in predicting 
particles such as the Higgs boson, discovered 
at the Large Hadron Collider (LHC) at CERN, 
Europe's particle-physics laboratory near 
Geneva, Switzerland. In the theory, to prevent 
the mass of the Higgs from ballooning beyond 
reasonable bounds, certain parameters have 
to be set just so, rather than be derived from 
first principles. This smacks of unnaturalness. 

To get rid of this ugliness, physicists devel- 
oped supersymmetry — an elegant theory in 
which every known particle has a hypotheti- 
cal partner particle. Supersymmetry made 
the Higgs mass natural. It also showed how 
three of the four fundamental forces of nature 
would have been one at energies that existed 
shortly after the Big Bang (an aesthetically 
pleasing scenario). It even unexpectedly pro- 
vided a particle, the neutralino, that could 
explain dark matter — matter that is unseen, 
yet thought to exist because of its observed 
gravitational effect on galaxies and galactic 
clusters. Hossenfelder explains that in com- 
bining everything that theoretical physicists 
value (symmetry, naturalness, unification 
and unexpected insights), supersymmetry 
has become “what biologists fittingly call a 
‘superstimulus’ — an artificial yet irresistible 
trigger”. 

But despite decades of theorizing by hun- 
dreds of top-notch physicists, everyone 
agrees that supersymmetry is in trouble. The 
most natural version of it, which requires 
no fine-tuning, has been ruled out by the 
LHC data. Hossenfelder quotes theorist 
Nima Arkani-Hamed as saying that the “best 
people” were aware of this problem well 
before the LHC went online. Hossenfelder 
then chides those very “best people” — no 
names are given — for not calling “bullshit” 
on widespread claims that the LHC would 
discover supersymmetry or dark matter. 

Hossenfelder often wears a journal- 
ist’s hat. Interviews with highly respected 
physicists (such as theorist Garrett Lisi 
and Nobel laureates Steven Weinberg and 
Frank Wilczek) form a significant chunk 
of Lost in Math, as Hossenfelder strives to 
make sense of the field and her own dis- 
satisfaction with it. We are introduced to 
myriad problems that plague physics, such 
as the fine-tuning of the standard model, 
the lack of a theory of quantum gravity, and 
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worries about what quantum mechanics 
is really saying about the nature of reality. 
(Full disclosure: the latter is the topic of my 
forthcoming book Through Two Doors at 
Once, which Hossenfelder has endorsed). 
Hossenfelder also worries about the lack of 
empirical evidence to sift through in check- 
ing the solutions (see N. Wolchover Nature 
555, 440-441; 2018). 

She reproduces many of her discussions 
with physicists at length, so the material can 
get a tad repetitive. She could have used her 
own strong voice to synthesize some of the 

arguments. Still, 


“Despite there are moments 
decades of when Hossen- 
theorizing, felder’s journalistic 
everyone forays stand out. 
agrees that For example, her 


supersymmetry account of meet- 
is in trouble.” ing the intimidat- 

ing Weinberg, who 
“talks like a book, almost print-ready’, is self- 
deprecatingly funny and spot-on (I speak 
from personal experience). 

Lost in Math is self-aware and dosed with 
acerbic wit, and it asks bold questions. Hos- 
senfelder’s Twitter followers and readers of 
her blog, ‘Backreaction, will recognize her 
no-holds-barred style. But not all physi- 
cists will agree with her. Theorizing in the 
absence of empirical data is not new, and 
has paid dividends. For instance, in the early 
1960s, the physicist Murray Gell-Mann used 
symmetry to tidy up the standard model 
and predict the existence of particles he 
called quarks. The mathematics turned out 
to be correct, and he won the 1969 phys- 
ics Nobel prize for the work. As he noted at 
the Nobel banquet: “The beauty of the basic 
laws of natural science, as revealed in the 
study of particles and of the cosmos, is allied 
to the litheness of a merganser diving in a 
pure Swedish lake” 

Hossenfelder acknowledges all this, but 
she also challenges those who seek to break 
the current impasse in physics by insist- 
ing that nature must be forever beautiful. 
Admitting that “complaining about aesthetic 
biases” won't make the daunting problems 
in physics go away, she argues for a few 
ground rules. These include making sure 
that there is a real problem, which emerges 
from existing conflicts in theory and data; 
being clear about one’s assumptions (such 
as the desire for naturalness or simplicity); 
and using empirical evidence to choose the 
right maths for the physics at hand. They are 
her compass points to prevent us from losing 
our way in a mathematical jungle, however 
beautiful. m 


Anil Ananthaswamy’ next book, Through 
Two Doors at Once, fells the story of 
quantum mechanics from the perspective of 
the double-slit experiment. 

e-mail: anil@nasw.org 
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Books in brief 


American Eden 

Victoria Johnson LIVERIGHT (2018) 

In the 1760s, colonial America was ravaged by yellow fever, typhus 
and tuberculosis. David Hosack, born into that world, became a 

titan of medical research in the fledgling nation. He published on 
tetanus and breast cancer, pioneered smallpox vaccination and, as 
Victoria Johnson’s fine science biography reveals, contributed vastly 
to medicinal botany. Hosack’s famed, now lost, Elgin Botanic Garden 
in New York City became a key training centre for scientists and 
surgeons, who peered “into the globe-spanning, dizzying complexity 
of the natural world” through plants. A rich and compelling read. 


The Neuroscience of Emotion 

Ralph Adolphs and David J. Anderson PRINCETON UNIVERSITY PRESS (2018) 
Anger, fear, joy: what are emotions, and what are they for? The 
sparsity of clear or robust answers spurred neuroscientists 

Ralph Adolphs and David Anderson to frame an integrated science 
of emotion. The result is scholarly, lucid and pertinent to both 
neurobiology and psychology. Mining research from the molecular 
level to the cognitive, they examine emotions as biological and 
reflective of evolved adaptations in species as varied as rodents, the 
fruit fly Drosophila melanogaster and Homo sapiens. They usefully 
conclude with open questions for future research. 
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Polio: The Odyssey of Eradication 

Thomas Abraham Hurst (2018) 

Despite the 99% reduction in polio cases since 1988 under the 
Global Polio Eradication Initiative (GPEl), the disease lingers on ina 
handful of countries. Meanwhile, vaccine-derived polioviruses have 
triggered outbreaks elsewhere. Science journalist Thomas Abraham 
travelled from slum to boardroom to research the GPEI’s premise 
and practice, as well as the broader trajectory of the disease and the 
efforts to tackle it. The result is a trenchant, well-argued analysis, 
isolating problems such as the initiative’s strategic focus on single 
vaccinations in regions also riddled with malaria and diarrhoea. 


Origin Story: A Big History of Everything 

David Christian LITTLE, BROWN (2018) 

Historian David Christian is, with Bill Gates, co-founder of the 

Big History Project, an online syllabus stretching from the 
beginnings of the cosmos to human hegemony. Here, Christian 
distils that 13.8-billion-year chronicle by simplifying the mapping. 
Each threshold, such as the Big Bang or the lunar landings, is 
beautifully captured. Heat energy becomes a “drunken traffic cop”; 
gravity the “virtuoso chain-saw sculptor” of the early Universe; 
humans uniquely “cultivate and domesticate” information, like 
farmers. Long-haul science with wit and oomph. 


Weather: An Illustrated History 

Andrew Revkin and Lisa Mechaley STERLING (2018) 

This fascinating chronicle of humanity’s complex relationship with 
weather by environmental journalist Andrew Revkin and science 
educator Lisa Mechaley is told through 100 milestones, each paired 
with a stunning archival image. A potted history of windscreen 
wipers sits next to a 1903 schematic of Mary Anderson’s invention; 
an image of eleventh-century Chinese scientist Shen Kuo faces his 
prescient observation of climate change; and a dramatic illustration 
of a waterspout accompanies Benjamin Franklin’s bizarre account 
of chasing and whipping a whirlwind in 1755. Barbara Kiser 
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Correspondence 


Wider human-rights 
focus for health data 


Those producing codes of 
conduct for life-sciences 
research under the European 
Union's General Data Protection 
Regulation (GDPR) should draw 
on established international 
work to secure the success of 
scientific data sharing and the 
secondary processing of personal 
data (see also Nature 557, 
467-468; 2018). 

Important guidelines include 
the 2017 Recommendation on 
Health Data Governance from 
the Organisation for Economic 
Co-operation and Development, 
and the Framework for 
Responsible Sharing of Genomic 
and Health-Related Data 
developed by the Global Alliance 
for Genomics and Health (of 
which we are all members). 

The GDPR focuses chiefly on 
the right to privacy. In addition, 
the Framework respects the 
right of everyone “to share in 
scientific advancement and 
its benefits” under Article 27 
of the Universal Declaration 
of Human Rights. In our view, 
this wider human-rights focus 
will be invaluable in regulating 
health and genomic research, 
and the related proportionate 
interpretation and application of 
the GDPR. 

Bartha M. Knoppers* McGill 
University, Montreal, Canada. 
bartha.knoppers@mcgill.ca 

*On behalf of 5 co-signatories (see 
go.nature.com/2lvg9d6 for full list). 


Don’t let the living 
dead haunt citations 


The continued citation of 
retracted papers — or ‘zombie’ 
publications — pollutes the 
scientific literature with fatally 
flawed studies. The problem 

is amplified by the common 
practice of accessing papers 
through third-party websites such 
as Google Scholar, ResearchGate 
and Sci- Hub, which generally do 
not link to retraction notices. We 
propose steps publishers could 
take to prevent new research 


from citing retracted studies. 

As wellas displaying retraction 
notices more prominently on 
their websites, journals should 
post alerts across all pages of 
the flawed publication. Also, 
prefacing the paper's title with a 
notification would warn readers 
not to download the citation to 
reference-manager software. 

Publishers can ensure that 
citations of zombie publications 
are caught before new papers go 
to press by running automated 
cross-checks of manuscript 
reference lists against the 
Retraction Watch database 
of retracted papers (http:// 
retractiondatabase.org). 
Universities, too, should ensure 
that institutional databases are 
updated to include retraction 
notices. 

Sandra A. Binning University of 
Montreal, Canada. 

Fredrik Jutfelt, Josefin Sundin 
Norwegian University of Science 
and Technology, Trondheim, 
Norway. 
sandra.ann.binning@umontreal.ca 


Reform Romania’s 
grant-review system 


Imagine a Nobel laureate 
willing to review research-grant 
applications for Romania. She 
or he would first need to learn 
Romanian, to produce a letter of 
permission to participate from 
their university president or 
department chair, and to upload 
a declaration on Romania's 
platform for grant reviewers 
(www.brainmap.ro) confirming 
that they have committed no 
ethical misdemeanours in the 
course of their duties in the 
previous 5 years. These strictures 
can only lead to the further 
marginalization and inbreeding 
ofa research system that is 
hobbled by plagiarism, paltry 
funds and brain drain. 
According to Eurostat, 
Romania is in the lowest tier of 
European Union countries in 
terms of the percentage of gross 
domestic product (GDP) spent 
on research and development. In 
2018, the nation allocated only 


0.18% of GDP to its Ministry 
of Research and Innovation. 
About one-quarter of these 
funds are used in national calls 
for proposals, which are sparse, 
unpredictable and currently 
evaluated by Romanian nationals. 
We urge the Romanian 
government to reinstate the use 
of international evaluators — 
with scientific merit as the sole 
criterion for selection. It should 
also restore the requirement that 
proposals be written in English. 
Mihai Miclaus National R&D 
Institute for Biological Sciences, 
Cluj-Napoca, Romania. 
Octavian Micu Institute of Space 
Science, Magurele, Romania. 
mihai.miclaus@icbcluj.ro 


Research hotspots 
in Cote d’Ivoire 


Cote d'Ivoire in West Africa 

has some promising research 
institutions, despite the 
considerable social and political 
turmoil it experienced between 
1999 and 2011 (see B. Bonfoh 

et al. Nature 474, 569-571; 2011). 

The top-ranking institutions 
for research productivity in 
2012-16 in Cote d'Ivoire were its 
two largest national universities: 
Félix Houphouét-Boigny 
University and Nangui Abrogoua 
University, both in Abidjan (our 
unpublished results). In third 
place was the comparatively 
small Swiss Centre for Scientific 
Research in Céte d'Ivoire 
(CSRS), which benefits from a 
long-standing bilateral research 
partnership between Cote d'Ivoire 
and Switzerland. At the CSRS, 
mutual governance, investment 
and benefits are balanced with 
a diverse portfolio of research, 
education and training within 
four strategic axes. 

In our view, a strong 
diversification of international 
funding sources, coupled with 
an increased share of national 
government funds, stand to 
create more centres of scientific 
excellence in Africa. 

Bassirou Bonfoh CSRS, Abidjan, 
Cote d'Ivoire. 
Jasmina Saric, Jiirg Utzinger 


Swiss Tropical and Public Health 
Institute, Basel, Switzerland. 
j.saric@swisstph.ch 


Pioneering women 
in energy physics 


lappreciate Roger Fouquet’s 
review of my book Energy: A 
Human History, but take issue 
with two of his criticisms (Nature 
557, 162-163; 2018). 

First, I did not reference solar- 
energy pioneer Maria Telkes 
because her work involved heat 
storage, not solar electricity — 
the subject of my discussion. 

Second, credit for the 
discovery of nuclear fission was, 
in my opinion, more complex 
than Fouquet implies and not 
attributable solely to physicist 
Lise Meitner. 

My reading of the history is 
that the German radiochemists 
Otto Hahn and Fritz Strassmann 
discovered fission in Berlin in 
late 1938. Meitner, who was in 
Sweden at the time, came up 
with the physical explanation 
for this reaction with her 
nephew Otto Robert Frisch, 
after corresponding with 
her former colleagues Hahn 
and Strassmann. The journal 
Naturwissenschaften (renamed 
The Science of Nature in 2015) 
received the Hahn-Strassmann 
paper on 22 December 1938; 
Nature received the Meitner- 
Frisch paper on 16 January 1939. 

It would be odd indeed had I 
forgotten Meitner after devoting 
more than 50 pages to her life 
and work in my 1987 history, 
The Making of the Atomic Bomb. 
Richard Rhodes Half Moon Bay, 
California, USA. 
richardrhodes1@comcast.net 
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OBITUARY 


Stanley Falkow 


(1934-2018) 


Microbe hunter who uncovered how bacteria cause disease. 


mechanisms through which bacteria 

cause disease and how to disarm them. 
But he always described himself as being “on 
the side of the microbes”. 

Falkow, who died on 5 May, began his 
career in the late 1950s, when DNA was first 
recognized as the stuff of life, and bacteria 
were thought to be good model systems for 
studying it. By the 1960s, he had worked out 
how to extract and isolate the extrachromo- 
somal packets of bacterial DNA that we now 
call plasmids, and found that these carry 
information from one microbe to another. 
This showed how bacteria can acquire the 
ability to survive on a new food source or 
become resistant to an antibiotic. 

He made the ominous observation that 
even harmless bacteria could serve as a 
source of resistance genes for pathogens. 
Falkow understood the public-health signifi- 
cance of this, predicted the rise of multidrug- 
resistant bacteria and campaigned against 
the overuse of antibiotics. His discoveries of 
the molecular nature of antibiotic resistance 
won him the 2008 Lasker-Koshland Special 
Achievement Award in Medical Science. 

The question that framed Falkow’s career 
was: what is a pathogen? Specifically, what 
makes some microbes disease agents, while 
others are innocuous or even beneficial? He 
hypothesized that bacteria have genes coding 
for specialized ‘virulence factors’ — the trans- 
ferable, molecular equivalents of claws, fangs, 
toxins, fur and camouflage — that allow path- 
ogenic bacteria to attack, hide and survive in 
extreme environments. He spent his life iden- 
tifying the genes and molecules behind these 
abilities. He discovered, for example, that 
some bacteria decorate their surfaces with 
molecules designed to adhere to host cells, 
and even fool the host into allowing bacteria 
to invade. Others have molecular needles that 
disarm white blood cells by injecting proteins 
that stop the cells moving, or prevent them 
from sounding the alarm. 

Falkow was born the son of Jewish immi- 
grants in Albany, New York, on 24 January 
1934 at the peak of the Great Depression. He 
was a terrible student, especially in mathe- 
matics and science, but loved to read. When 
he was 11, he found in the local library the 
book Microbe Hunters by Paul de Kruif. 
Written for lay audiences in 1926, it describes 
the personalities and adventures of some of 
the most important figures in microbiology. 
Falkow devoured it and decided to become a 
microbe hunter. His life was a conversation 


S tanley Falkow discovered the molecular 
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with his heroes from this book. 


Microbe Hunters begins with Antonie van 
Leeuwenhoek, who, in the 1670s, was the 
first to craft glass lenses so powerful that he 
was able to visualize microscopic organisms. 
When Falkow read this, he had to have a 
microscope. He found a Gilbert Company 
Hall of Science microscope on the shelves of 
the local toy shop (he could not afford it, but 
in the end, the shop owner gave it to Falkow). 
Through this, Falkow witnessed the darting 
shadows of bacteria growing in the spoilt 
milk that he kept under his bed. 

Falkow’s love of microscopy continued 
throughout his career, which started with 
a PhD in 1961 from Brown University in 
Providence, Rhode Island. He learnt elec- 
tron microscopy and, in the 1970s, at the 
University of Washington School of Medi- 
cine in Seattle, used it to visualize resistance 
genes that had hopped on to plasmids. In the 
1980s at the Rocky Mountain Laboratories in 
Montana and at Stanford University School 
of Medicine in California, where he chaired 
the department of microbiology and immu- 
nology, he made micrographs of Yersinia and 
Salmonella bacteria invading the intestine. 

He was the first to make movies with a 
videomicroscope to show how Salmonella 
invade gut epithelial cells by causing a literal 
splash of cytoskeleton and membrane at the 
cell surface. He adapted diverse methods 
of painting and tagging molecules inside 
bacteria and host cells to follow their fate 
during infection, and even to find bacterial 
genes that turn on only when inside cells. 


Another of Falkow’s heroes, Robert Koch, 
established a set of guidelines in the 1880s 
(Koch’s postulates) to define when a microbe 
is responsible for an infection. Falkow 
extended this idea in the 1980s, developing 
the ‘molecular Koch’s postulates. These 
describe how one could define virulence at 
the molecular level — for example by inactiv- 
ating specific genes to render a microbe 
harmless, or transferring a gene for adhesion 
or invasion to an innocuous bacterium to 
improve its ability to interact with host cells. 
His lab elucidated the mechanisms of disease 
of many bacteria (and even yeasts), including 
major gut pathogens such as Salmonella and 
pathogenic Escherichia coli, along with bac- 
teria that cause skin, urinary-tract and sexu- 
ally transmitted infections, whooping cough, 
pneumonia, sepsis, and even stomach cancer. 

Falkow came to think of the “pathogenic 
lifestyle” as being not about causing disease, 
but rather about subtly manipulating a host. 
He became best friends with his prey. Over 
time, he spoke more and more about the 
potential beneficial attributes of bacteria that 
live on and in our body, foreseeing the study 
of the human microbiome. 

Despite his fame, Falkow often said that his 
greatest achievements were those of his stu- 
dents. One of his most important gifts was his 
ability to infect these students with the same 
sense of awe that he had when he first peered 
into a microscope. More than 120 scientists, 
including some of today’s most prominent 
microbiologists, were trained in Falkow’s labo- 
ratory. Despite his illness, he was still teaching 
at Stanford last semester, and video-calling 
into classroom discussions and lab meetings. 

For pragmatists interested in tools for con- 
quering the microbial world, Falkow’s vision 
provided the technical framework by which 
to define potential targets for vaccines and 
methods to identify, fingerprint and follow 
the evolution of pathogens through molecular 
epidemiology. For pure biologists, this same 
vision provided the foundation for under- 
standing the natural history of pathogenic 
and mutualistic microbes. For those of us who 
had the good fortune to know him, he left a 
bounty of stories and examples of his generos- 
ity of spirit as a scientist and a mentor. m 


Manuel R. Amieva is an associate professor 
of Pediatric Infectious Diseases and of 
Microbiology & Immunology at Stanford 
University School of Medicine; he was one of 
Falkow’s last postdoctoral fellows. 

e-mail: amieva@stanford.edu 
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QUANTUM PHYSICS 


Entanglement on demand 


Two experiments show that non-classical correlations can be distributed between distant nodes of a quantum network ina 
deterministic way. The work smooths the path for extended quantum networks. SEE LETTERS P.264 & pP.268 


JULIEN LAURAT 


he phenomenon of entanglement, 

by which two physical systems can 

be more strongly correlated than is 
possible in classical physics, is at the heart of 
quantum mechanics. The efficient distribution 
of entanglement between distant nodes of a 
quantum network has become a cornerstone 
of future quantum technologies — from secure 
long-distance communication to powerful 
quantum computing. However, the simulta- 
neous entanglement of more than a few nodes 
remains a tremendous challenge. On pages 264 
and 268, respectively, Kurpiers et al. ‘and 
Humphreys et al.” demonstrate two methods 
for delivering entanglement in a deterministic 
way that could greatly facilitate the construc- 
tion of large-scale quantum networks’. 

Quantum nodes take a variety of forms. 
They c an comprise an ensemble of optical 
emitters, including laser-cooled atoms or ions 
that are embedded in a crystal. Alternatively, 
they can rely on a single emitter such as an 
atom, a defect in diamond or a superconduct- 
ing quantum bit (qubit). Regardless of the 
physical platform, the generation of entangle- 
ment between two nodes needs to be done in 
a way that can be extended to many nodes. 
Probabilistic methods are unsuitable for such 
a task because simultaneously establishing a 
large number of entangled nodes leads to an 
exponential decrease in the overall chance of 
success. 

The challenge of scalability is highlighted, 
for example, by the prospect of long-distance 
quantum communication. Unlike the signals 
in conventional telecommunications, quantum 
information cannot be amplified or regener- 
ated. As a result, long-distance communica- 
tion requires a quantum-repeater architecture, 
in which the distance between the two com- 
municating parties is divided into shorter 
segments, and entanglement is generated 
in each segment. The segments can then be 
connected, enabling long-distance entangle- 
ment to be established for use in applications 
such as quantum teleportation and quantum 
cryptography. However, if the distribution of 
entanglement in each segment is probabilis- 
tic, the chance of success falls exponentially 
as the distance increases. This scaling issue is 
also present in local networks in which many 
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Nitrogen- 
vacancy centre 


Microwave 
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Figure 1 | Entanglement delivered on time. Kurpiers et al.’ and Humphreys et al.’ report two methods 
for distributing entanglement — or non-classical correlations — between distant nodes of a quantum 
network in a deterministic fashion. a, In Kurpiers and colleagues’ experiment, the two nodes consisted 
of superconducting quantum bits (qubits) that were fabricated on separate chips. Entanglement was 
generated by a highly efficient process comprising the emission of a microwave photon from one qubit 
and its capture by the other, through a wire. The authors used devices known as microwave cavities 
(shown as pairs of disks) to prepare and read out the qubit, and to transfer the photon. b, In Humphreys 
and colleagues’ experiment, the two nodes were defects known as nitrogen-vacancy centres in diamond. 
For each node, the spin (magnetic moment) of the nitrogen-vacancy centre was entangled with the 
presence or absence of an optical photon emitted into a fibre. The two fibres were connected by a device 
called a beam splitter, and the detection of a single photon (indicated by the flash) projected the two spins 
into an entangled state. By producing entanglement more quickly than it was lost, the authors turned a 


probabilistic protocol into a deterministic one. 


modules need to be combined and entangled. 

One way to overcome this limitation is to 
generate entanglement in a deterministic 
manner: press a button to get entanglement. 
This is the landmark advance that Kurpiers 
and colleagues demonstrate. In their experi- 
ment, the quantum nodes consisted of two 
superconducting qubits that were fabricated 
on separate chips and then connected by a 
90-centimetre-long wire (Fig. 1a). Each qubit 
was strongly coupled to two devices known 
as microwave cavities. One cavity enabled the 
preparation and read-out of the qubit, whereas 
the other facilitated the transfer of microwave 
photons through the wire. 

Kurpiers and colleagues’ demonstration 


relied on a 20-year-old seminal proposal for 
directly transferring a quantum state from a 
sender to a receiver*. The authors began by 
preparing one qubit in a specific state. They 
then applied microwave pulses to the qubit, 
which caused it to release a photon that was 
captured by the other qubit. Because the pro- 
cesses of photon emission and capture can 
be extremely efficient in superconducting 
chips, this set-up leads to the distribution of 
entanglement every 0.02 milliseconds, with an 
overall fidelity (a parameter that measures how 
close the entanglement is to the ideal state) of 
close to 80%. Improvements to the experi- 
ment could push this to more than 90%, and 
error-correction procedures might improve 
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this number even further. Such on-demand 
generation of entanglement was also reported 
recently* using a different chip implementation 
from that of Kurpiers and colleagues. 

The direct-transfer strategy has also been 
carried out using optical photons between 
distant atoms or ions*. In that case, reaching 
high efficiencies of photon emission and cap- 
ture remained a challenge. But another way to 
overcome the prohibitive scaling of quantum 
networks is to turn a probabilistic method into 
a deterministic one. 

Under a probabilistic protocol, each attempt 
to produce entanglement has a low chance of 
success. However, if enough attempts are real- 
ized in a given length of time, the generation 
of entanglement can be ensured in this time 
frame. Such an approach can therefore provide 
deterministic entanglement at a predeter- 
mined time. But there is a crucial requirement 
for achieving this goal: entanglement must be 
produced more quickly than it is lost, or else 
the generated entanglement could be gone 
before it has been delivered. 

In 2015, a trapped-ion experiment 
succeeded in breaking this threshold’. 
Humphreys and colleagues have now achieved 
the same feat using a solid-state system. In 
their work, the two quantum nodes were 
single defects, known as nitrogen-vacancy 
centres, in diamond (Fig. 1b). The authors 
placed diamonds in cooling devices that were 
separated by a distance of 2 metres. They then 
generated entanglement between the spins 
(magnetic moments) of the nitrogen-vacancy 
centres by adopting a ‘heralded’ technique 
that has been used for other platforms and has 
enabled rudimentary versions of quantum- 
repeater segments*”. 

Humphreys et al. prepared the two nodes 
so that they had an identical spin state. For 
each system, the authors used laser pulses to 
generate entanglement between the spin of 
the nitrogen-vacancy centre and the presence 
or absence of an optical photon emitted into a 
fibre. The fibres from both systems were con- 
nected, and the detection of a single photon 
midway between the nodes projected the two 
spins into an entangled state. This is because 
there was no way of knowing, even in princi- 
ple, from which node the photon was emitted. 
This single-photon scheme was one of two key 
ingredients of the authors’ work: it enabled a 
much higher rate of entanglement production 
than could be achieved in a previous study’® 
using nitrogen-vacancy centres that relied on 
a two-photon process. 

The second key ingredient was a dramatic 
extension in the lifetime of the stored entangle- 
ment to a period of hundreds of milliseconds. 
The authors achieved this by protecting the 
stored state after it was produced. Overall, 
the combination of the two ingredients ena- 
bled entanglement to be generated almost 
ten times faster than it was lost. Thanks to 
this achievement, Humphreys and colleagues 
demonstrated a deterministic delivery of 


entanglement roughly every 100 milliseconds. 

The long-awaited advances of these two 
research groups demonstrate that the pros- 
pect of realizing functional quantum net- 
works — either on the local scale between 
superconducting modules or on a larger scale 
between communication nodes that are con- 
nected by optical fibres — is getting closer to 
reality (see Nature 554, 289-292; 2018). The 
numbers still need to be improved; in particu- 
lar, the rate at which long-distance entangle- 
ment can be delivered remains too low for 
practical applications. However, an increase 
by a factor of about 100 should be achievable 
in the near future. 

To demonstrate large-scale and long-haul 
quantum networks, a combination of tech- 
niques and tools will be necessary. Among 
other methods, complementary approaches 
that rely on ensemble-based quantum memo- 
ries are being developed at a fast pace’’. When 
combined with massive multiplexing in time, 
frequency or space — a process that has been 
necessary for the development of the Inter- 
net — these methods should be able to provide 
entanglement at a high rate. 

Another important goal will be the realiza- 
tion of an efficient quantum converter that 
links microwave photons to optical photons”. 


IMMUNOLOGY 
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Such a device should enable the platforms 
used in these two studies, which have differ- 
ent capabilities, to be connected. Putting these 
blocks together will be a tremendous challenge 
for science and engineering, but it promises 
to lead to versatile networks in which quan- 
tum processors are interconnected through a 
quantum-communication web. = 


Julien Laurat is at the Laboratoire Kastler 
Brossel (Sorbonne Université, CNRS, ENS, 
Collége de France), Sorbonne Université, 
Campus Pierre et Marie Curie, 75005 Paris, 
France. 

e-mail: julien.laurat@sorbonne-universite.fr 
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Tumour tamed by 
transfer of one T cell 


The T cells of the immune system can be engineered to target a tumour, but why 
some people respond better than others to such therapy is unclear. One patient’s 
striking response to treatment now offers some clues. SEE LETTER P.307 


MARCELA V. MAUS 


he use of genetically engineered 

immune cells to target tumours is one 

of the most exciting current develop- 
ments in cancer treatment. In this approach, 
T cells are taken from a patient and modified 
in vitro by inserting an engineered version of 
a gene that encodes a receptor protein. The 
receptor, known as a chimaeric antigen recep- 
tors (CAR), directs the engineered cell, called 
aCART cell, to the patient’s tumour when the 
cellis transferred back into the body. This ther- 
apy can be highly effective for tumours that 
express the protein CD19, such as B-cell acute 
leukaemias'” and large-cell lymphomas”. 
However, some people do not respond to CAR 
T cells, and efforts to optimize this therapy are 
ongoing. On page 307, Fraietta et al.” report 
the fortuitous identification of a gene that 
positively affected one person's response to 
treatment with CAR T cells. 


Therapies involving engineered immune 
cells use viral vectors based on retroviruses or 
lentiviruses to insert a DNA sequence, such as 
one encoding a CAR, into a person's T cells. 
However, given that there is no control over 
where the sequence inserts into the genome, 
it is possible that the engineered gene could 
insert at a location that disrupts another, 
important gene. In the early 2000s, a clinical 
trial’ enrolled people with immunodeficiencies 
arising from the lack ofa functional copy of a 
particular immune gene. The trial used viral 
vectors to insert a wild-type copy of this gene 
into their stem cells. Unfortunately, however, 
several people developed uncontrolled T-cell 
proliferation that evolved into T-cell leukae- 
mia. This event was linked’ to the gene insert- 
ing within the sequence of the LMO2 gene, 
disrupting the normal regulation of LMO2. 

The pattern of genomic integration sites 
for various viral vectors has been found to 
be specific for a given combination of vector 
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Figure 1 | Tumour targeting by CAR T cells. Ifa patient's T cells are engineered to express a version of an 
immune-cell receptor called a CAR, the cells can target tumour cells that express a specific protein, such as 
CD19. However, not everyone responds to this treatment. Fraietta et al.” report that one patient’s response 
to CAR T-cell treatment has revealed a gene that can affect therapy success. The patient had a mutation in 
one of their copies of the TET2 gene. TET2 encodes an enzyme that converts methyl (CH;) groups attached 
to DNA into hydroxymethyl (CH,OH) groups. This type of change is known as an epigenetic modification. 
When a CAR-encoding sequence was introduced into the patient's T cells using a viral vector, in one cell 
the CAR sequence inserted into the patient’s non-mutated copy of TET2 and disrupted the gene, thereby 
generating a cell that lacked any functional copies of TET2. The clonal descendants of this cell eradicated 
the patient’s tumour. The lack of TET2 altered the cell’s profile of epigenetic modifications, which can affect 
gene expression. This TET2 deficiency was associated with an increase in the expression of tumour-killing 
factors such as the enzyme granzyme, as well as entry into a cellular state called the central memory state, 
which stops the cells from entering a dysfunctional mode called exhaustion. 


and cell type®. In a study of people who had 
T cells modified using retroviral vectors, the 
integration events were not implicated as the 
cause of any cancers’. Lentiviral vectors inte- 
grate randomly into the genome but tend to 
preferentially locate at sites of transcriptionally 
active genes'’. Although random integration 
is generally thought to be safe, any disruption 
of the genome nevertheless confers a risk of 
unwanted consequences. 

The effectiveness of treatments involving 
CAR T cells has been linked to the persistence 
and proliferation of the CAR T cells in the per- 
son's body, and this can be affected by factors 
including the disease subtype, the molecular 
design of the CAR used, and even the manu- 
facturing process’. Fraietta et al. report the 
unusual response of a person in a clinical trial 
whose CART cells targeted a CD19-expressing 
tumour called chronic lymphocytic leukae- 
mia. In this case, disruption of the gene into 
which the CAR sequence had been inserted 
had a direct and beneficial effect on the clinical 
outcome. 

The patient began to show a noticeable 
response to treatment two months after 
receiving a second dose of CAR T cells. 
Tumour regression normally occurs 
within a month if treatment is success- 
ful, so the authors investigated the reason 
for the delay in this case. Crucially, they 
analysed the nature of the CAR T cells at peak 
concentrations in the blood during tumour 
regression. Fraietta and colleagues made 
the surprising observation that these CAR 
T cells consisted almost exclusively of a clonal 
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population descended from a single cell. 

This single cell’s progeny divided over time 
until the cellular descendants reached a tipping 
point that eliminated the entire tumour. It is 
remarkable that the minimally effective and 
curative dose of this form of immunotherapy 
can be the introduction of just one cell. This 
raised the question of why introducing the 
CAR sequence to this specific T cell caused 
such an effective antitumour response. 

In this clonal population of T cells, the CAR 
sequence had inserted into a copy of the TET2 
gene, preventing the gene from encoding a 
functional protein. The patient’s other copy 
of TET2 had a mutation, so insertion of the 
CAR sequence generated T cells that lacked 
TET2 protein. TET2 is an enzyme, also called 
methylcytosine dioxygenase, that catalyses 
a hydroxylation reaction that alters methyl 
groups attached to DNA (Fig. 1). Such modi- 
fications of DNA or its associated proteins are 
known as epigenetic modifications, and they 
can affect gene expression in some cases. When 
Fraietta and colleagues compared the patient's 
T cells that lacked the CAR insertion with 
those into which the CAR had been inserted, 
the overall epigenetic profile was similar. How- 
ever, differences in the structure of the DNA- 
protein complex called chromatin were 
observed in genes involved in T-cell function, 
including CD28, ICOS and the gene that encodes 
interferon-y. 

TET2 mutations have previously been 
associated with clonal blood-cell altera- 
tions linked to a risk of disease or blood 
cancers (a phenomenon known as clonal 


haematopoiesis)''. However, the patient’s 
T cells that lacked TET2 did not give rise to 
either aberrant T-cell proliferation or cancer. 
After tumour elimination, the number of CAR 
T cells decreased appropriately, replicating the 
normal pattern for a T-cell population (increas- 
ing in response to its target and declining after 
target elimination). 

The authors used genetic engineering 
to remove TET2 in human T cells in vitro. 
Analysis of these cells revealed a connection 
between the absence of TET2 and the promo- 
tion and maintenance of T cells in a cellular 
state known as a central memory state. This 
state helps to prevent the cells from entering a 
dysfunctional mode called exhaustion, which 
is linked to ineffective tumour targeting by 
T cells. The absence of TET2 was also linked 
to an increase in long-term T-cell memory. 

Fraietta et al. observed that human T cells 
lacking TET2 made fewer immune signalling 
molecules called cytokines than did cells that 
had TET2. Disruption of TET2 was also linked 
to an increase in the level of the enzymes per- 
forin and granzyme, which are components of 
the tumour-killing machinery of T cells. These 
roles of TET2 in T-cell function and memory 
were previously unknown. 

These remarkable findings might suggest 
that targeting TET2 in human T cells through 
drug-mediated inhibition or gene-editing 
techniques could increase the effectiveness 
of CAR T-cell treatment for other patients. If 
so, perhaps the dose of CAR T cells needed 
might be only a few cells, rather than the usual 
50 million to 500 million cells. This would 
shorten the waiting time for CAR T cells and 
lower the substantial manufacturing costs. 
However, given the known associations of 
TET2 mutations with certain disease states, 
this approach might run the risk of generating 
a malignancy. 

Enhancing CAR T-cell function is an area of 
active research, and other options to achieve 
this goal have been proposed. Inserting a CAR 
sequence at the genomic location where the 
natural version of the gene resides enhances 
the activity and persistence of CAR T cells in 
a mouse model”. Other groups have reported 
progress’ in making CAR T cells resistant to 
inhibitory checkpoint-signalling pathways that 
hinder T-cell function. 

Will one of the many possible approaches be 
preferable to the others? Unfortunately, animal 
studies are not always predictive of results in 
humans, so clinical trials are the only way to 
answer this definitively. The good news is that 
it seems likely that many of these approaches 
will enhance efficacy and safety, so there is 
hope that the use of CAR T cells to treat can- 
cer will become even more successful in the 
years to come. 
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Remote wiggling helps 
cold enzymes work 


Mutations introduced far from the active site of an enzyme can cause local 
unfolding that increases enzyme activity. This finding suggests how organisms 
that live in the cold can speed up biochemical reactions. SEE LETTER P.324 


ASHOK A. DENIZ 


he biochemical sciences have tended 

to focus on processes that take place at 

‘physiological temperatures of around 
37 °C. But much of Earth’s surface is covered 
with ocean, ice or snow, and is replete with 
organisms that function at much lower tem- 
peratures. Life in these environments requires 
suitable biological adaptations, for example in 
the enzymes that maintain the chemical envi- 
ronment of the cell. On page 324, Saavedra 
et al.' shed light on a biophysical mechanism 
for such low-temperature adaptation that 
operates at the molecular level. Their results 
show strikingly that protein modifications 
distant from an enzyme’ active site can mod- 
ulate localized unfolding of the enzyme — 
effectively, wiggling of parts of the enzyme’s 
structure — that can control several facets of 
enzyme-reaction mechanisms. 

Physical chemists have long known that 
the rate of chemical reactions depends on 
temperature, and that reaction rates generally 
decrease as temperatures drop. This tempera- 
ture dependence also applies to enzyme-cata- 
lysed reactions, raising the intriguing question 
of how psychrophilic organisms (which live at 
low temperatures) can maintain their reper- 
toire of enzyme-mediated functions. Related 
enzymes in psychrophilic organisms and in 
mesophilic organisms (which live at physi- 
ological temperatures) have similar activities 
— that is, the reactions they catalyse occur at 
similar rates”. For this to occur, the functional 
parameters of the cold-adapted enzymes must 
have been tuned to compensate for the lower 
temperatures. 

A clue to how this tuning could occur came 
from previous observations’ that psychrophilic 
enzymes tend to have more surface glycine 
mutations (in which an amino-acid residue 
on the protein's surface is replaced by a glycine 
residue) far from the active site than do similar 


enzymes in mesophilic organisms. However, 
the mechanistic details of this phenomenon 
were poorly understood. Saavedra et al. used 
an enzyme called adenylate kinase to test the 
mechanism by which such glycine mutations 
act from a distance to alter enzyme function. 
Adenylate kinase catalyses reactions that 
help to maintain a balance of adenosine 
phosphates (molecules that act as the energy 
currency of cells). The authors chose this 
mesophilic enzyme because it has been used 
extensively as a model system for investigat- 
ing enzyme biophysics, biochemistry and fold- 
ing, including by researchers from the same 
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laboratory as Saavedra and co-workers. 

In the present work, the authors tested 
the previously discussed idea’ that tuning of 
entropy — a measure of disorder — is a major 
driving force in the adaptation of enzymes to 
low temperatures. They sought to probe the 
effect of surface glycine mutations, at locations 
far from the active site, that might change the 
‘wiggling’ (an entropic effect) of the protein 
without changing its overall folded structure. 

There are three domains in adenylate kinase: 
the CORE domain, which contains much of the 
active site, and LID and AMPbd, each of which 
contains part of the active site. The authors 
studied glycine mutations in both the LID and 
the AMPbd domains by using a combination 
of biophysical and structural techniques. These 
studies included measuring the stability of the 
enzyme variants and their binding affinity to 
a mimic of the enzyme’s substrate, and a more 
detailed characterization of the protein states 
and structural fluctuations by using nuclear 
magnetic resonance spectroscopy. 

Taken together, Saavedra and colleagues’ 
results demonstrate that adenylate kinase 
exists in at least three different states, and that 
the mutations change the relative occupancy 
(stability) of these states. Compared with the 


mutation 


Enzyme activity T 


Figure 1 | Entropic tuning of enzyme function allows cold adaptation. Saavedra et al.' made mutants 
of the enzyme adenylate kinase, replacing non-glycine amino-acid residues with glycine residues at the 
surface of either the LID domain (red) or the AMPbd domain (blue). Both types of mutation caused local 
unfolding of the enzyme, increasing its disorder (entropy), and altered the enzyme’s functional behaviour, 
despite being distant from the active site. The LID mutations decreased the affinity of adenylate kinase 
for its substrates, whereas the AMPbd mutations increased the enzyme activity. Such entropic tuning of 
function might be an evolutionary mechanism that allows enzymes to cope with low temperatures, which 


usually slow enzymatic reactions. 
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wild-type protein, both the LID and the AMPbd 
mutants decrease the occupancy of the fully 
folded structure, but they increase the stability 
of two different states in which either the LID or 
the AMPbd domain is locally unfolded (Fig. 1). 
The increased stability of these locally unfolded 
states stems from the fact that the footprint ofa 
glycine amino-acid residue is smaller than that 
of other amino-acid residues, which means 
that the protein chains in the glycine mutants 
are more flexible than those in the wild-type 
protein. Remarkably, these two types of local 
unfolding alter different aspects of the enzyme’s 
function: LID unfolding decreases its binding 
affinity, whereas AMPbd unfolding increases 
its activity. 

A particularly interesting aspect of the 
authors’ work is that it helps us to understand 
how surface glycine mutations act at a distance 
to support cold adaptation. This phenomenon 
might seem mysterious at first glance, but it 
is encapsulated in the idea of allosteric regu- 
lation** — a common form of enzyme regu- 
lation in which the binding of a molecular 
partner at a site distant from the active site 


affects enzyme activity. The conventional 
view of allosteric regulation has been that 
binding of the partner causes small structural 
changes that propagate through the protein to 
alter the structure of the active site. However, 
there is now evidence for mechanisms involv- 
ing changes in the dynamics (entropy), rather 
than in the structure, of the unbound state for 
many cases of allosteric regulation®®. The cur- 
rent work provides striking tests and examples 
of such entropic allosteric regulation for differ- 
ent enzyme properties. 

Saavedra and colleagues’ results have sub- 
stantial implications for the evolution of 
enzyme function. However, their mechanistic 
proposal for cold adaptation was tested for just 
one model enzyme, so its relevance for other 
cold-adapted enzymes requires further testing. 
Deeper insight into the biophysical mecha- 
nisms of cold adaptation will also benefit from 
more-detailed views of the structural fluctua- 
tions of enzymes afforded by single-molecule 
experiments”. Nevertheless, the findings 
open up new avenues for exploring allosteric 
control of multiple modes of protein function, 


How humans and rising 
seas affect each other 


The Paris climate agreement aims to limit global warming to no more than 2 °C. 
An analysis suggests that this will greatly reduce the risks of sea-level rise for 
coastal communities, but that it will take time to see the benefit. 


AIMEE SLANGEN 


oastal zones are among the most 

densely populated areas in the world’, 

but they are threatened by rising sea 
levels caused by climate change. Writing 
in Earth’s Future, Brown et al.’ estimate the 
impact of sea-level rise in terms of the land 
area and the number of people exposed, for 
several scenarios in which global warming 
is limited to different temperature increases. 
Their study shows that the amount of expo- 
sure to sea-level rise depends on our ability 
to cap global temperature changes, and that 
the main benefits of this cap will be seen only 
after 2100. 

Brown and colleagues began their investi- 
gation by simulating how global temperature 
will change in response to different scenarios 
of greenhouse-gas emissions, using a simple 
computational model of the Earth system’. 
They then calculated the global mean sea- 
level rise that would occur as a result of the 
projected temperature changes, assuming that 
the main causes of sea-level rise are the expan- 
sion of the volume of ocean water associated 
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with warming, and the melting of land-based 
ice (that is, ice over land in glaciers, Antarctica 
and Greenland). To account for the fact that 
sea-level rise does not occur uniformly across 
Earth, they then scaled their time series of 
global mean sea levels with previously reported 
projections’ of regional patterns of sea-level 
change for 2100. 

In the scenario in which global temperature 
increases are capped at 1.5°C before 2100, they 
find a median sea-level rise of 0.4 metres by 
2100 and of 1 m by 2300. By contrast, in the 
scenario in which temperatures continue to 
increase as they are doing now, sea-level rises 
are 0.8m in 2100 and a staggering 4.5 m in 
2300. These results show that there will be 
some sea-level rise regardless of efforts to miti- 
gate climate change, because sea levels will not 
immediately stop rising when the temperature 
targets are met. However, the effect of capping 
global temperatures early will be increasingly 
felt after 2100 and lead to significantly less 
sea-level rise by 2300. 

In a second step, Brown et al. considered 
the area of land that has an average chance of 
being flooded once every 100 years (that is, a 


both in natural evolution and in rational 
protein engineering" for biotechnology. m 
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1% chance in any given year); such flooding 
events can be severe (Fig. 1). To do this, they 
used a database of the world’s coastline char- 
acteristics’, which includes land-elevation 
data measured by radar. The authors com- 
bined the data with their scenarios of sea-level 
rises, and found that an area of 540,000 square 
kilometres is already at risk of 1-in-100-year 
coastal flooding events. For the scenario in 
which the global temperature rise is mitigated 
to 1.5°C, this area increases to 620,000 km? 
by 2100, and to 702,000 km’? by 2300 (values 
correspond to the 50th percentile of the range 
of predicted values for sea-level rises). In the 
absence of mitigation, they find that the area 
at risk by 2100 (708,000 km’) is not much 
different from that in the mitigation scenario, 
but increases to 1,630,000 km’ by 2300, which 
is three times the area at risk today. 

In the third component of Brown and 
colleagues’ study, the authors consider the 
number of people at risk from coastal flood- 
ing. If global warming is kept to 1.5°C, they 
find that 1.5-2.1% of the global population will 
be exposed to a 1-in-100-year coastal flooding 
by 2100, compared with 4.3-5.4% of the global 
population in the non-mitigation scenario. In 
other words, more than half of the potential 
population exposure can be avoided by 2100 
if global warming is capped. It is important to 
keep in mind, however, that population expo- 
sure does not depend only on the amount of 
sea-level rise — the number of people exposed 
could decrease if people move away from the 
coast, for example. 

Given that sea levels will rise irrespective 
of future global temperature changes, Brown 
et al. stress that at least some action will need to 
be taken to adapt. However, their calculations 
of the land area exposed to sea-level rise do 
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Figure 1 | Flooding in New Orleans, Louisiana, after Hurricane Katrina, 2005. Brown et al? report that flooding events, such as that caused by Hurricane 
Katrina, will affect more land and people in the future as a result of sea-level rises associated with global warming. 


not consider the effects of coastal-protection 
measures, such as the construction of dykes 
or dunes. If such measures were considered 
in the analysis, then the area and number of 
people exposed would change. Moreover, the 
construction of coastal defences will be largely 
driven by economic considerations, which will 
be different for different countries. 

The authors define land at risk from sea- 
level rise as the 1-in-100-year coastal flood- 
plain. But will people be driven away from 
such land by the infrequent floods, or will 
they accept the occasional inundation, mov- 
ing away only temporarily as needed? It might 
be better to consider the area of land that will 
become permanently flooded to make a more- 
direct estimate of the population that will be 
exposed to sea-level rise. 

A limitation of the study is that Greenland, 
Antarctica and all glaciers elsewhere are 
lumped together as the land ice that contri- 
butes to sea-level rise. However, ice will prob- 
ably melt at different rates in each of these 
regions. This will cause the relative contri- 
butions of the different sources to change 
over time. One way to improve the regional 
estimates of sea-level rise would therefore 
be to scale the contributions of the ice sheets 
according to their individual effects. 

Nevertheless, studies such as those of 
Brown and colleagues are essential, because 
they show the complexity of the climate sys- 
tem’s response to change and how this affects 
society. By directly connecting the effects 
of climate change to the consequences for 
humans, the authors clearly show that climate 
mitigation needs to happen now for a better 
future. m 
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A new phase in 
transcription 


Asubunit of the enzymatic complex P-TEFb can induce compartmentalization 
of proteins into liquid-like droplets in cells. This phase separation might help 
P-TEFb to promote gene transcription. SEE LETTER P.318 


JAMES A. GOODRICH & DYLAN J. TAATJES 


enomic DNA is not indiscriminately 
transcribed into RNA. Instead, select 
sequences are transcribed at any given 
time or in any given cell type. How this process 
is regulated has been an active area of research 
for decades, because of its complexity and 
importance in embryonic development and 
disease. On page 318, Lu et al.' describe an 
aspect of transcription regulation that involves 
liquid-liquid phase separation — a process by 
which proteins, nucleic acids and other mol- 
ecules self-organize into liquid-like droplets 
to enable subcellular compartmentalization. 
Transcription is catalysed by the enzyme 


RNA polymerase II (Pol II). The activity 
of Pol II is, in turn, regulated by a complex 
called P-TEFb, which consists of the protein 
cyclin Tl (CCNT1) and the kinase enzyme 
CDK9. P-TEFb binds to the carboxy-terminal 
domain (CTD) of Pol II (ref. 2) — this domain 
consists of 52 repeats of 7 amino acids, 5 of 
which can be phosphorylated. The CCNT1 
subunit regulates the activity of CDK9, which 
can phosphorylate the Pol II CTD many times 
to help control Pol II function’. 

Lu et al. first investigated which region of 
CCNT1 is responsible for regulating CDK9 
activity. They found that mutations in a 
domain of CCNT1 rich in the amino acid 
histidine led to defects in the activation of 
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Figure 1 | Phase separation in transcription regulation. a, The P-TEFb complex consists of two 
proteins, CCNT1 and CDK9. Lu et al.' have identified a histidine-rich domain (HRD) in CCNT1 that 
promotes liquid-liquid phase separation — a process in which P-TEFb and the enzyme RNA polymerase 
II (Pol II) coalesce inside a liquid-like droplet in the cell nucleus, within which they weakly interact. b, 
The authors provide evidence that these weak interactions within phase-separated droplets enhance 
direct, functional interactions between P-TEFb and Pol II that allow CDK9 to phosphorylate the 
carboxy-terminal domain (CTD) of Pol II, promoting the ability of Pol II to transcribe DNA. 


Pol II-mediated transcription in vitro. Notably, 
removal of this histidine-rich domain (HRD) 
decreased phosphorylation of the full-length 
Pol II CTD by P-TEFb, but had no effect on 
phosphorylation of a truncated, nine-repeat 
version of the CTD. The Pol II CTD is known 
to undergo phase separation into liquid- 
like droplets’, but this ability declines as the 
length of the CTD decreases®. The authors’ 
results therefore hinted that the HRD might 
be involved in phase separation, and that this 
process might improve the ability of P-TEFb 
to phosphorylate the full-length Pol II CTD. 

The researchers then fused CCNT1 to a 
DNA-binding domain that directed it to sites 
at which transcription begins. This fusion 
strategy allowed the group to evaluate the func- 
tion of CCNT1 on its own, away from other 
proteins that would typically guide it to DNA. 
The fusion protein activated transcription, but 
activation decreased if HRD was deleted. This 
finding further supports the idea of CCNT1 
being involved in phase separation, because 
other proteins that induce phase separation 
behave similarly if fused to DNA-binding 
domains*. Presumably, Pol II is recruited into 
phase-separated droplets formed by CCNT1, 
thus concentrating Pol II around DNA and 
promoting the activation of transcription. 

Next, Lu et al. carried out a series of 
experiments that demonstrated that the HRD 
of CCNT1 does indeed promote phase sepa- 
ration in vitro and in cells. As expected for a 
domain that serves a key regulatory function, 
the CCNT1 HRD is evolutionarily conserved 
among vertebrates. Moreover, the authors 
found that a similar domain in a kinase called 
DYRK1A also regulated that enzyme's activity 
and promoted phase separation. 

Finally, the group investigated whether the 
HRD influences the location and dynamics of 
CCNT1 in the nuclei of living cells. Using two 
complementary techniques to track CCNT1 
movement, the researchers provided evidence 
that the presence of the HRD facilitates the 
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retention of CCNT1 at actively transcribing 
genes. Consistent with this finding, CCNT1 
formed concentrated clusters called speckles 
in the cell nucleus in an HRD-dependent 
manner. These speckles were dispersed by 
1,6-hexanediol — a hydrophobic compound 
that disrupts phase separation. 

Lu and colleagues’ work suggests that the 
P-TEFb complex can trigger the formation of 
liquid-like droplets around transcriptionally 
active genomic regions. However, P-TEFb does 
not require phase separation to interact with 
the Pol II CTD (ref. 2), and so it will be chal- 
lenging to determine the exact contribution 
of phase separation to P-TEFb’s function in 
transcriptional regulation. The authors show 
that the HRD augments the ability of CCNT1 
to directly interact with the Pol II CTD. This 
observation can at least partially explain the 
defects in transcription caused by HRD muta- 
tions, but changes in phase separation are 
probably also involved. 

The authors propose that compartmentaliza- 
tion through phase separation, which involves 
multiple weak interactions between proteins 
within a droplet, might concentrate P-TEFb 
and Pol II together. This, in turn, probably pro- 
motes functional interactions between P-TEFb 
and Pol II, leading to highly efficient phospho- 
rylation of the CTD (Fig. 1). In this way, phase 
separation could ensure robust regulation of 
gene-expression programs, and enforce high- 
level activation of genes to enable rapid or 
sustained responses to extracellular stimuli. 

Lu and co-workers’ study raises many 
interesting questions. For example, do 
P-TEFb complexes selectively associate only 
with nuclear phase-separated domains that 
contain Pol II? Or can P-TEFb also associate 
with phase-separated droplets that are devoid 
of Pol II? A transcriptional repressor protein 
called HP1a can also promote phase separa- 
tion®’. Thus, phase-separated domains seem 
to demarcate both transcriptionally active and 
transcriptionally inactive genomic regions. It 


remains to be seen whether the HP1la and 
the P-TEFb compartments are biophysically 
distinct, and whether each can selectively 
exclude specific sets of proteins, nucleic acids 
or metabolite molecules to effectively repress 
or promote transcription, respectively. 

The impact of CCNT1 HRD mutations on 
global gene-expression patterns also remains 
to be defined. On the basis of current mod- 
els*, it seems plausible that highly expressed 
genes will be most sensitive to HRD muta- 
tions, because their elevated expression might 
require high local concentrations of Pol II and 
transcription factors in phase-separated drop- 
lets. If this is indeed the case, cells that carry 
HRD mutations might be less able than wild- 
type cells to mount effective transcriptional 
responses to DNA damage or viral infection, or 
might fail to maintain cell-type-specific gene- 
expression programs over time. Such defects 
could contribute to cancer and other diseases. 
To our knowledge, any clinical manifestations 
of HRD mutations are unknown, and await 
discovery. 

P-TEFb phosphorylates other transcrip- 
tional regulatory proteins in addition to 
Pol II (ref. 9). As such, inhibition of P-TEFb 
kinase activity shows promise as a therapeutic 
strategy for combating a variety of diseases, 
including HIV infection and cancer’. Com- 
pounds that inhibit P-TEFb repress transcrip- 
tion across the entire genome’ , and it has been 
assumed that the inhibitors alter the function 
of P-TEFb’s numerous targets for phospho- 
rylation’. Is phase separation also involved? 
Phase-separated domains can reversibly form 
and dissolve, and this behaviour is sensitive 
to phosphorylation’. Notably, Pol II CTD 
phosphorylation seems to either promote or 
prevent phase separation, depending on which 
other proteins are present in the droplet’*”. 
Inhibition of P-TEFb kinase activity might 
therefore alter the formation and dissolution 
of phase-separated droplets. This intriguing 
possibility remains to be rigorously tested. m 
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n uninterrupted thread of discovery, triumph, 
A« mystery—tempered by disaster, exploitation, 
and misery—courses through humanity's 

interaction with Antarctica. The relationship is riven REVIEWS 
with conflict, and heroic interventions are often required: 
reminiscing about unimaginable suffering and bleak 
prospects, geologist and explorer Sir Raymond Priestley 
once said “...get down on your knees and pray for 
Shackleton”. Now, even though Shackleton continues 
to inspire, the situation is largely reversed. Rather than 
humans seeking salvation from Antarctica’s harsh caress, 
itis Antarctica that must seek protection from humanity. > 

This Nature Insight on Antarctica peers into these 
many facets of Antarctica’s past, present and possible 
futures. In the first Review, Ed Brook and Christo 
Buizert synthesize 800,000 years of climate and 
atmospheric history to depict Antarctica’s response 
to and influence on the broader climate system. Steve 
Rintoul then reviews the vast Southern Ocean, and 
shows that localized processes—such as eddy-driven 


200 Antarctic and global climate history 
viewed from ice cores 
Edward J Brook & Christo Buizert 


209 The global influence of localized 
dynamics in the Southern Ocean 
Stephen R Rintoul 


mixing sourced from current interaction with bottom te ae apmemiaicelce 
topography—drive the overall system. Sheet from 1992 to 2017 
An alarming amount (more than 50 metres) of sea- The IMBIE team 


level rise lurks in the Antarctic Ice Sheet, and a first step 
in understanding the ice’s fate is knowing the current REVIEW 
mass imbalance. In an Analysis, the IMBIE team brings 223 Trends and connections across the 
together the lengthening remote sensing record and Antarctic cryosphere 

: , , Andrew Shepherd, Helen Amanda Fricker 
reveals an accelerating trend of ice loss. Then, ina Sinead Laisa are? 
Review, Andy Shepherd, Helen Amanda Fricker and 
Sinead Louise Farrell discuss the tightly interlinked 
processes governing recent trends in Antarctic sea ice, ice 
shelves and grounded ice. 

Finally, in a Perspective arising from a 2014 panel 
discussion attended by winners of the Tinker-Muse Prize 
for Science and Policy in Antarctica, Steve Rintoul and : 
colleagues present two visions of Antarctica in 2070: in = 2 
one, humanity’s appetite for resources and fossil fuels 
continues to increase, and governance weakens; in the 
other, world development proceeds with a sharper focus 
on conservation, mitigation, and strong governance. 

The Insight brings together much of our understanding 
of natural science in and around Antarctica, reveals our 
current impacts on it, and outlines a pathway through 
which the ghost of Shackleton might not need to raised, 
on behalf of an entire continent. 


PERSPECTIVE 

233 Choosing the future of Antarctica 
SR Rintoul, S L Chown, R M DeConto, 
MH England, HA Fricker, 
V Masson-Delmotte, T R Naish, 


Michael White MJ Siegert & J C Xavier 
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Antarctic and global climate history 
viewed from ice cores 


Edward J. Brook! & Christo Buizert! 


A growing network of ice cores reveals the past 800,000 years of Antarctic climate and atmospheric composition. The 
data show tight links among greenhouse gases, aerosols and global climate on many timescales, demonstrate connections 
between Antarctica and distant locations, and reveal the extraordinary differences between the composition of our 
present atmosphere and its natural range of variability as revealed in the ice core record. Further coring in extremely 
challenging locations is now being planned, with the goal of finding older ice and resolving the mechanisms underlying 
the shift of glacial cycles from 40,000-year to 100,000-year cycles about a million years ago, one of the great mysteries 


of climate science. 


he origins of the Antarctic ice sheet are in late Eocene-early 

Oligocene time, about 34 million years ago; the last major phase 

of growth began at about 14 million years Bp (before present)'”. 
Coring in the present ice sheet (Box 1) has recovered records up to 
800,000 years (800 kyr) old and there are hopes of extending the contin- 
uous record further. New studies on the ice sheet margin have provided 
unique but discontinuous samples of older ice’. Owing to the flow of 
ice from the interior to the ice sheet margin, ice deposited during the 
earliest history of the ice sheet is not likely to have been preserved, but the 
maximum age of extant ice is unknown. 

The Antarctic ice sheet preserves a history that links Antarctica to the 
rest of Earth. Richly detailed and uniquely preserving the composition of 
the atmosphere, the record in the ice underpins much of global climate 
change research. Drilling and recovering long ice cores (Box 1) require 
specialized engineering owing to the low temperatures and the high pres- 
sures inside the ice sheet, the need to preserve the ice for analysis, and the 
remote locations of deep-drilling sites. The stratigraphically continuous 
dataset from cores in the dry ice sheet interior that extends to 800 kyr 
is complemented by more detailed, but younger, datasets from coastal 
sites with higher snowfall rates (Box 1). Collectively, Antarctic ice cores 
provide fundamental insights into the nature of global climate cycles 
driven by orbital variations, internal climate variability on sub-orbital 
timescales, changes in global biogeochemical cycles, Antarctic climate 
dynamics, abrupt climate change, and a host of other topics. 

One of the strengths of the ice core record is that a wide variety of 
parameters, reflecting different aspects of the Earth system, can be meas- 
ured in great detail in the same core. The isotopic composition of the ice 
is a proxy for local temperature’, while the chemical composition records 
the input of dust, sea salt, volcanic material, pollutants, other aerosol 
material, and even extraterrestrial dust®*. Past snowfall rate, a funda- 
mental climate parameter, can be derived from layer counting and other 
age constraints®. The temperature of the ice sheet itself retains a memory 
of past climate and can be measured in the ice core borehole!”. Gradual 
compaction of the firn converts snow to ice, trapping small samples of 
the atmosphere in a matrix that is remarkably resistant to gas loss. The 
trapped air provides the only direct record of changes in atmospheric 
composition prior to modern atmospheric measurements. 

Over the last decade, drilling and analysis of Antarctic ice cores have 
uncovered a large amount of new information. This paper reviews the 
myriad ways in which these data show how connected Antarctica is to 


the rest of the world. Deep ice cores from the Antarctic interior show 
that on long timescales Antarctic climate closely follows variations in 
solar insolation that drive climate change globally and that it exhibits 
major temperature changes at the termination of ice ages. New results 
from high-resolution cores in high-accumulation regions provide 
unprecedented detail about the millennial-scale climate ‘seesaw’ between 
Antarctica and the Northern Hemisphere, a signature of variations in 
ocean heat transport related to shifts in Atlantic Ocean circulation. 
The dust content of ice cores reveals enhancements in dust flux in cold 
climates, with possible contributions to ocean productivity. Greenhouse 
gas data from trapped air show how global biogeochemical feedbacks 
contribute to climate change on long and short timescales, and that 
climate and greenhouse forcing are extremely tightly coupled (Box 2). 


The long view: ice age cycles 

Two deep ice cores from low-accumulation regions of the East Antarctic 
Plateau, one at Dome Concordia! (Dome C) and a second at Dome 
Fuji!? (Dome F, or Valkyrie Dome) provide now-iconic records of 
multiple glacial cycles as far back as 800 kyr and 720 kyr Bp, respectively 
(Fig. 1). These results complement the pioneering 420-kyr dataset from 
the Vostok ice core!?. Viewed broadly, almost all climate-related param- 
eters in these records show major, synchronous variations across the 
glacial cycles, which have an average duration of about 100 kyr. The 
Antarctic records closely mirror other global environmental proxies, 
most notably the oxygen isotopic composition of benthic foraminifera 
in deep-ocean sediments (Fig. 1g), which is widely used as an index for 
global ice volume and glacial—interglacial conditions'*. All Antarctic 
records show a characteristic sawtooth pattern on these timescales, with 
a gradual cooling trend from glacial inception to peak glacial condi- 
tions, followed by a relatively fast glacial termination. 

The stable water isotope ratios (6D or &'80) traditionally used as 
temperature proxies suggest that the amplitude of East Antarctic 
temperature change over the glacial cycles ranges from about 6 °C 
to 13 °C, with the largest-amplitude oscillations’! during the last 
450 kyr. Ice isotope records are an indirect measure of temperature, 
however, and are influenced by other phenomena related to moisture 
transport and deposition. The ice sheet has a long thermal memory, 
and in locations with thick ice and high accumulation rate borehole 
thermometry can be used to circumvent these problems and estimate 
glacial—interglacial temperature change. Borehole-based temperature 


1College of Earth, Ocean, and Atmospheric Sciences, Oregon State University, Corvallis, OR, USA. *e-mail: brooke@geo.oregonstate.edu 


200 | NATURE | VOL 558 | 14 JUNE 2018 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


REVIEW 


Box | 
Ice core drilling 
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reconstructions at the WAIS Divide site in west Antarctica'® indicate 
a glacial-interglacial temperature change of 11.3 + 1.8 °C for the last 
termination, consistent with estimates based on stable water isotopes 
alone. Globally, glacial-interglacial temperature change has been 
estimated’ at about 3.5 °C; the higher Antarctic values confirm the 
theory that polar temperatures change more than tropical temperatures 
do (polar amplification)’°. 

It is understood that the glacial cycles are paced by variations in 
Earth's orbit!®, with insolation changing due to orbital eccentricity 
(with an approximately 100-kyr period), axial tilt (with an approxi- 
mately 41-kyr period) and precession of the equinoxes (with an approx- 
imately 19-23-kyr period). In terms of radiative forcing at the top of 
the atmosphere, the insolation changes by themselves are too weak 
to drive global temperature changes, and feedbacks such as changes 
in greenhouse gases and the high albedo of extensive (Northern 
Hemisphere) glaciation are required to explain the observed climate 
variations)”. 

Several challenges remain to our understanding of glacial- 
interglacial dynamics, foremost of which is the lack of a clear expla- 
nation for the apparent 100-kyr periodicity of the glacial cycle, given 
that the 100-kyr eccentricity cycle produces negligible variations in 
either seasonal or annual radiative forcing (the so-called ‘100-kyr 
problem’)!7-!°. Closer inspection shows that individual cycles are 
not uniform in length; accurate U/Th dating of speleothems suggests 
that glacial terminations are actually spaced by four to five precession 
cycles, supporting the theory that changes in Northern Hemisphere 
insolation driven by precession are the predominant driver of glacial 


The science of ice core drilling originated in the 1950s and the first deep core in Antarctica was at Byrd Station in 1968, during the 
International Geophysical Year!!!. Since then, numerous deep-coring projects have been completed (blue circles in a in the figure). The 
longest core, at Vostok Station, reaches 3,700 m below the ice surface. The oldest so far, the EPICA Dome C ice core, extends to 800,000 
years!! 12. The technology for deep ice coring has gradually evolved, with recent developments in larger volume and replicate coring drills!!3, 
and new access tools that will allow quick sampling of deep ice sections, coring within bedrock and analysis in situ!®1-!3, National archive 
facilities retain samples from all deep ice cores, preserving a unique resource for the scientific community. At an ice core drilling camp at the 
South Pole (b), the long arch structure contains the drill. The tipping tower of the US Deep Ice Coring Drill is shown (c). An ice core section 

in the WAIS Divide Drill (d) is shown immediately after a drilling run. A core section from the WAIS Divide site (e) shows a visible volcanic ash 
layer. The core is 12 cm in diameter. Image credits: b, US National Science Foundation (http://spicecore.org/photos.shtml); photographs in 


terminations”’. Several models have been put forward to ‘predict’ which 
Northern Hemisphere insolation maxima lead to glacial terminations 
and which do not!??!. Most proposed solutions to the 100-kyr prob- 
lem rely on the inertia of gradually growing Northern Hemisphere ice 
sheets that have the ability to survive several insolation maxima, typ- 
ically with a nonlinear response of ice volume to insolation!”!®??-*, 
A second challenge concerns the remarkable coherence of Antarctic 
temperature variability and global climate change on orbital timescales, 
although orbital forcing due to precession acts with opposite effect 
in each hemisphere”’. The canonical view holds that global climate 
responds to summer solstice insolation at 65° N, the latitude band of the 
large Northern Hemisphere ice sheets. Indeed, accurate dating of the 
Dome Fuji ice core confirms that Antarctic orbital-scale climate change 
follows Northern Hemisphere insolation closely, with the largest warm- 
ings concurrent with rising Northern Hemisphere summer insolation”. 
The most obvious mechanism to explain this bi-hemispheric coher- 
ence is through the globally well mixed greenhouse gases, although 
their link to Northern Hemisphere insolation remains incompletely 
understood. Rising Northern Hemisphere insolation could further 
drive enhanced Laurentide ice sheet meltwater runoff into the North 
Atlantic to reduce the Atlantic overturning circulation, which would 
in turn warm Antarctica via the so-called ‘bipolar seesaw’ (see section 
“The close view: millennial-scale variability’), suggesting a role for 
ocean circulation in synchronizing the hemispheres>”>”’. Alternatively, 
it has been suggested that synchroneity at precession frequencies arises 
from the fact that Antarctic temperature is sensitive to local summer 
duration (which changes in-phase with the Northern Hemisphere 
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Box 2 
CO, and temperature phasing 
during the last deglaciation 


The question of the relative timing of greenhouse gas and 
(Antarctic) temperature change at the glacial terminations 

has generated substantial interest. Evaluating this phasing is 
complicated by the difference in age between the ice and the 
gas trapped inside it. Early studies suggested a substantial 

(600- to 1,200-year) lag of the COz2 concentration rise behind 
Antarctic warming!!4!!7, More recent work (see figure), based 
on high-accumulation cores and improved treatment of firn 
compaction, finds COz and Antarctic temperature to be more 

or less synchronous (within uncertainty) for the last two glacial 
terminations®+!18119, The close relationship between Antarctic 
temperature and COz obviously reflects important feedbacks and 
interactions between the global carbon cycle and the climate 
system. The early studies suggesting a long lag of CO2 increase 
behind Antarctic warming have been misinterpreted as proof of 
a negligible warming effect of greenhouse gases; this is incorrect 
because (1) the onset of Antarctic warming is driven by changes 
in interhemispheric heat exchange, rather than by COs, and (2) the 
global temperature rise lags COs, rather than leads it!. However, 
what then does this phasing tell us? One interpretation is that 

he close phasing reflects the dominant role of Southern Ocean 
ventilation in setting atmospheric COz levels!!9. Alternatively, 

he rise in both COz levels and Antarctic temperature may be 
caused by reduced North Atlantic Deep Water formation, in which 
case their synchroneity reflects a common driver, rather than 
interdependence. Much as in Douglas Adams’ The Hitchhiker’s 
Guide to the Galaxy, although the answer to our question is now 
apparent, precisely what it signifies remains to be revealed. 

The figure shows atmospheric CO2 change from the WAIS Divide 
(Antarctica) ice core for the period 19-9 kyr ago, global temperature 
reconstruction}, the east Antarctic oxygen isotope stack (the water 
189/160 isotope ratio anomaly relative to the present)!19, and the 
atmospheric CH, record from the WAIS Divide ice core?2, Vertical 
yellow bars show the timing of major inflection points in the COz 
record. Grey shading around the black trace indicates uncertainty 
in the temperature reconstruction. LGM, Last Glacial Maximum; 
HS1, Heinrich Stadial 1; BA, Balling—Allerad; ACR, Antarctic Cold 
Reversal; and YD, Younger Dryas. 
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summer solstice insolation intensity in the precession band), rather 
than summer peak insolation (which changes out-of-phase with the 
same)”®. Although local insolation may play a role at some ice core 
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sites”’, the precise timing and magnitude of temperature variations 
suggest that the bipolar seesaw and greenhouse gas variations are the 
dominant influences on orbital-scale Antarctic climate. 

A third challenge is that the amplitude of the glacial cycles in 
Antarctic temperature, CO, and ice volume increased around 450 kyr 
BP, at the so-called Mid-Brunhes event, through an intensification of 
the interglacial climates (Fig. 1). Atmospheric CH, and other markers 
of tropical hydrology appear to be only weakly affected by this transi- 
tion*®*°. It is debated whether the Mid-Brunhes is truly a single event, 
or whether this apparent transition simply emerges from the orbital 
forcing without changes in the underlying coupling between insolation 
and climate’®*!3!, Ocean sediments show that the current ‘100-kyr 
world’ was preceded by a “41-kyr world’ in which glacial cycles were 
paced by Earth’s axial tilt“. The transition between these two occurred 
between 1,200 kyr and 800 kyr bp, a period not covered in the ice core 
record. A major goal of the international ice coring community is to 
recover a continuous ice core through this key transition (see section 
‘The future of Antarctic ice core science’), primarily to examine changes 
in greenhouse forcing and the temporal patterns of Antarctic climate. 

In addition to the palaeotemperature data, the long ice cores pro- 
vide co-registered records of the flux of dust, sea salt and other atmos- 
pheric aerosols, and the concentrations of long-lived atmospheric 
gases (Fig. 1). Glacial periods are characterized by much higher levels 
of mineral dust deposition (Fig. 1c and d)**73, commonly attributed 
to changes in both source strength and atmospheric transport, with 
the former term dominating**. Climate-driven changes in Southern 
Hemisphere dust source regions (Patagonia and possibly Australia) 
include more exposure of sources due to aridity or glaciation, and 
increases in wind strength*”. Changes in iron delivery to the ocean 
from dust** are probably involved in changes in ocean productivity 
and atmospheric CO, levels*>°. There has been considerable interest 
in developing an (aerosol-based) ice core tracer of past sea ice extent, 
given the important role of sea ice in the climate system. Initially prom- 
ising candidates include sea salt sodium and methanesulphonic acid. 
Further work suggested that quantitative interpretations are difficult 
because of a variety of confounding effects in the transport and pro- 
duction of both tracers*”. 

The three major greenhouse gases (carbon dioxide (CO2), methane 
(CH,) and nitrous oxide (N2O)) also show large variations on the same 
timescale as ice volume and Antarctic climate over the glacial cycles 
(Fig. le and f)*°"8?, The largest variations are systematically associated 
with the glacial terminations, although the imprint of axial tilt and pre- 
cession is also evident. These greenhouse gas changes act as a positive 
feedback on the orbitally paced glacial cycles and account for about 
40% of the glacial-interglacial change in Earth’s radiative balance”. 

Atmospheric CO, concentrations range from roughly 170 to 300 
parts per million (p.p.m.) over the glacial—interglacial cycles (Fig. le). 
In addition to their impact on Earth's radiative budget, CO; variations 
reflect global carbon cycling and climate-biosphere interactions. A sat- 
isfactory accounting for the full magnitude of the glacial atmospheric 
CO), reduction is still lacking*"!, although it is fairly clear that multiple 
processes operated. Changes in carbon storage on land and changes in 
CO, solubility in the ocean are obvious players, but the sum of these 
effects results in little net change”. Therefore, a higher glacial oceanic 
carbon inventory must also be involved and both physical and biolog- 
ical processes in the ocean probably contributed. 

The Southern Ocean is a key region of interest. The dense, deep water 
mass originating around Antarctica that fills the abyssal oceans occu- 
pied more volume during the last glacial period and was more isolated 
and poorly ventilated, allowing it to store large quantities of respired 
carbon”, Increased efficiency of the biological pump, for example, 
through iron fertilization or Southern Ocean stratification*®, could 
further draw down atmospheric CO). Wind-driven upwelling in the 
Southern Ocean brings deep waters rich in respired carbon near the 
surface, allowing carbon exchange with the atmosphere; low nutrient 
utilization in the Southern Ocean's surface makes this CO; ‘leak to the 
atmosphere more effective*!®, Processes that may control the deglacial 
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Fig. 1 | Data covering the last 800,000 years from long Antarctic ice 
core records, and the benthic isotope stack, a proxy for global glacial- 
interglacial cycles, with upward direction corresponding to warm 
interglacial conditions. a, The EPICA Dome C 8D (7H/'H isotopic ratio 
of water)!!. VSMOW, Vienna Standard Mean Ocean Water. b, The Dome 


release of carbon through the Southern Ocean include the extent of 
Antarctic summer sea ice” or the strength and position of the Southern 
Hemisphere westerly winds*’. The atmospheric carbon isotopic com- 
position measured in ice cores is consistent with deglacial ventilation 
of respired carbon from the deep ocean**®. The uniformity of glacial 
minimum CO), levels is an interesting challenge, suggesting a consistent 
negative feedback, the nature of which is not yet clear®. 

CH, rose by about 300 parts per billion (p.p.b.) at glacial termina- 
tions, with smaller changes during glacial cycles linked to insolation 
variations driven by orbital tilt and precession*® (Fig. 1f). On millen- 
nial timescales, CH, is tightly coupled to Dansgaard—Oeschger events 
(see section “The close view: millennial-scale variability’), although 
the response appears to be modulated by insolation®®. A number of 
factors have been invoked to explain CH, variations on these times- 
cales, including changes in emissions from wetlands (the major mod- 
ern source), release of CH, from sea floor hydrates or permafrost, and 
changes in the atmospheric sink (primarily hydroxyl radical). Isotopic 
tracers and modelling do not support a dominant role for the last two 
factors?!~>°. Instead, CH, variations are consistent with changes in 
tropical and mid-latitude hydroclimate, driven by both insolation and 
shifts in the position of the Inter-Tropical Convergence Zone (ITCZ) 
during abrupt climate events. Additional boreal contributions may 
have existed during interglacials when Northern Hemisphere ice sheets 
retreated*®. The ice core record contains no indications of very large 
bursts of CH, during interglacial periods, even during those warmer 
than the Holocene, suggesting that “CH, time bomb’ scenarios of a 
runaway positive carbon cycle feedback for the Arctic are unlikely in 
the near term. A slower, more chronic release of CH, from melting 
permafrost, thermokarst lakes or marine hydrates is expected asa result 
of global warming”. 

N20 also varies on glacial—interglacial timescales, with typical 
glacial and interglacial values of around 210 p.p.b. and 270 p.p.b., 
respectively*’. Although the associated radiative forcing is small, the 
history of this gas is of interest as an integrative tracer of changes in the 
global nitrogen cycle and because of likely positive feedbacks of global 
warming on the modern NO budget. The primary natural sources are 


Fuji 580 ('80/18O isotopic ratio of water)!”. c, d, The EPICA Dome C®! 
and Dome Fuji!” dust records. e, f, The EPICA Dome C/Vostok CO,! and 
CH,” records. g, Benthic oxygen isotope stack'*. PDB, Pee-Dee belemnite 
standard. 


microbial nitrification and denitrification in both marine and terres- 
trial ecosystems. Changes in the ocean source are linked to circulation- 
driven changes in upper ocean oxygen that affect denitrification™. 
Changes in nitrification on land are linked to temperature and rain- 
fall, primarily in the tropics. Stable isotope data indicate that both the 
marine and terrestrial sources increased during the last deglaciation, 
with fast changes” in the terrestrial source that paralleled those in CH. 

Apart from the changes in the main greenhouse gases, other aspects 
of the atmospheric evolution revealed by Antarctic ice cores add to our 
understanding of Earth system change on long timescales; the list below 
is by no means exhaustive. First, the isotopic composition of atmos- 
pheric O2 shows a strong orbital precession signal, with superimposed 
millennial-scale variations linked to abrupt climate change. On both 
timescales this reflects the strength of the global monsoon, which is 
modulated by both insolation and the position of the ITCZ'?. Second, 
atmospheric O, concentrations have been falling at a rate®! of 8.4%o per 
million years, suggesting that at present O> sinks (the oxidation of sed- 
imentary organic carbon and pyrite) exceed O, sources (the burial of 
the same) by about 2%. Third, ice core measurements demonstrate the 
steady accumulation of radiogenic “°Ar (from “°K decay in the crust) 
in the atmosphere, allowing an estimate of the contemporary crustal 
degassing rate and providing a dating technique for old ice. Last, the 
atmospheric Kr/N)2 ratio is a proxy for global mean ocean tempera- 
ture®, owing to the temperature dependence of solubility. The reduced 
atmospheric krypton inventory during the Last Glacial Maximum sug- 
gests that the mean oceanic temperature was 2.57 + 0.24 °C lower than 
at present™. The trend of mean ocean warming during the last deglaci- 
ation follows Antarctic temperature and atmospheric CO, closely, fur- 
ther demonstrating the close link between the high-latitude Southern 
Hemisphere and the global climate system. 


The close view: millennial-scale variability 

Ice cores also preserve the impact of millennial-scale and shorter 
climate variability, increasingly recognized as important for under- 
standing Earth system feedbacks and the potential for abrupt future 
change. The now well known, abrupt Dansgaard—Oeschger events 
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Fig. 2 | Abrupt climate variability of the last Ice Age. a, Records of abrupt 
climate variability. From top to bottom, the traces show Greenland water 
isotope ratios from the NGRIP core®’, atmospheric CH, from the Antarctic 
WAIS Divide ice core®*, Antarctic ice core water isotope ratios from the 
WAIS Divide core’* and atmospheric CO, from a multi-core compilation!!”. 
Water isotope ratios are measured relative to Vs;MOW. Blue bars show the 


observed in Greenlandic ice cores (Fig. 2) and other Northern 
Hemisphere climate records covering the last ice age®-®” have well 
documented counterparts in Antarctica®*-73, termed Antarctic 
Isotope Maxima (AIM) events. The precise relative timing of events 
in Greenland and Antarctica has been established by using well mixed 
atmospheric gases as stratigraphic markers®*- "734, Antarctica warmed 
during Northern Hemisphere cold periods and cooled when Greenland 
was warm (Fig. 2). This bi-polar linkage is well documented for the 
last ice age and deglaciation. It very probably operated in prior ice ages 
but does not have obvious counterparts during interglacials such as 
the present Holocene. There are regional differences in the expression 
of the AIM events, which may ultimately provide more information 
about mechanisms”. 

The concept of the ‘bi-polar seesaw’ emerged from these obser- 
vations of asynchronous temperature variations between the 
hemispheres”””°. The basic theory is that perturbations to the north- 
ward, cross-equatorial heat transport of the Atlantic Ocean exert 
opposite temperature effects on both hemispheres, with the Antarctic 
counterpart damped by a large heat reservoir, commonly assumed to 
be the Southern Ocean. Climate models in which the strength of the 
Atlantic Meridional Overturning Circulation (AMOC) is perturbed, 
usually through freshwater input into the North Atlantic, can reproduce 
the seesaw pattern’ ’®., 

Recently, the WAIS Divide site in West Antarctica provided a climate 
record with temporal resolution similar to that of the central Greenland 
cores, although extending only to 68 kyr (the longest stratigraphically 
ordered Greenland record is NGRIP, at about 123 kyr long). This new 
ice core combines high accumulation rate, very low gas/ice age differ- 
ence, and high-precision atmospheric records. These attributes allow 
for the most precise investigation of the interpolar phasing of the 
bipolar seesaw yet’?. Comparison of Antarctic and Greenlandic events 
reveals on average an approximately 200-year-long lag of the Antarctic 
response behind both Greenland abrupt warming and cooling events 
(Fig. 2). The origin of this delay must lie in the climate coupling 
between the hemispheres, and indicates a north-to-south propagation 
of the climate signal dominated by oceanic processes (given that atmos- 
pheric propagation would be much faster). Further work on the WAIS 
Divide ice core”? explored interhemispheric atmospheric teleconnec- 
tions by analysing the deuterium excess record, a parameter believed to 
reflect source moisture conditions and transport pathways to the site. 
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approximate timing of Heinrich Stadials 5 to 3. Numbers indicate AIM 
events. b, Interpolar phasing of abrupt climate change. The records from a 
are aligned at the abrupt Northern Hemisphere transitions (yellow vertical 
lines), and averaged to obtain the shared climatic signal. The Antarctic 
cooling response of the bipolar seesaw is delayed by around two centuries 
behind the abrupt Northern Hemisphere events”’. 


This study showed that a component of the WAIS Divide deuterium 
excess variations is closely correlated with Greenland climate at zero 
time lag, implying a fast (atmospheric) link between abrupt warming 
in the north and shifts in Southern Hemisphere moisture pathways to 
Antarctica. A similar response had been observed in the Dome C core 
for termination II®°. These studies indicate that the two polar regions 
are coupled via both oceanic and atmospheric teleconnections, each 
operating on their own timescale. 

Tracers of mineral dust deposition in Antarctic ice cores also vary on 
millennial timescales®””*! with lower dust deposition during warmer 
periods. Changes in dust transport and generation of dust in source 
areas are both likely to be involved. Sea salt sodium does not display 
as clear a link to the millennial-scale climate events, but as discussed 
above, interpretation of this proxy is complicated. Deposition of 
both dust and sea salt aerosol appear to be regionally variable within 
Antarctica. 

Atmospheric gas records from Antarctic ice cores are providing more 
detail about millennial-scale variability in global biogeochemical cycles. 
Changes in CO, tend to follow Antarctic AIM events closely (Fig. 2), 
particularly for the larger events, as well as for the deglacial warm- 
ing where the two are essentially synchronous (see Box 2). This close 
association of Antarctic temperature and CO; suggests that Southern 
Ocean processes are also critical to atmospheric CO) variability on 
these timescales. Changes in export production or ventilation in the 
Southern Ocean are candidate mechanisms. Both can be linked to 
AMOC changes, for example, via increased upwelling during Heinrich 
stadials due to southward migration of the Southern Hemisphere 
westerly winds”, or via more direct impacts of AMOC changes on 
ocean circulation® or the efficiency of the biological pump*®. 

High-resolution CO, data reveal that very rapid carbon cycle 
variations are superimposed on the slower millennial changes during 
the last ice age and deglaciation®***. There appear to be two types of 
rapid CO, increase. The first is exemplified by increases during two 
major warming events in the Northern Hemisphere during the degla- 
ciation: the Younger Dryas termination and Bolling-Allerod warming 
(Box 2). In each case, CO? increases of about 10 p.p.m. took place over 
periods of 100-200 years. Their coincidence with northern warming 
is unambiguous, given coincident increases in atmospheric CH, that 
mark the two abrupt northern events. The second type is associated 
with Heinrich stadials, periods of ice-rafted debris discharge into the 
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Fig. 3 | Dependence of millennial-scale variability on the background 
climate state. a, Dansgaard—Oeschger interstadial duration (logarithmic 
scale) from Greenland ice core records plotted against Antarctic 
temperature’? and atmospheric COs, with the coefficient of determination 
R? listed for each case. b, Schematic of Greenland Dansgaard-Oeschger 
variability during various states of the background climate. On the left, 
during cold (low-CO3) periods such as Marine Isotope Stages (MIS) 2 

and 4, Dansgaard-Oeschger interstadials are infrequent and of short 


North Atlantic that occurred during several of the Greenland stadial 
periods. Clear centennial-scale CO, rises are seen during Heinrich 
Stadial 1 (16.3 kyr Bp)*4 (Box 2) and Heinrich Stadial 4 (39.5 kyr Bp), 
These events are synchronous with short-duration oscillations® in 
atmospheric CHy, suggesting that the abrupt shifts in both gases are 
synchronous with changes in tropical hydrology, for example, due to 
southward shifts in the position of the ITCZ. 

Rapid changes in terrestrial carbon storage are a potential expla- 
nation for these abrupt increases in CO2. Warming in the Northern 
Hemisphere at the onset of the Bolling period and end of the Younger 
Dryas (Box 2) could conceivably alter the carbon balance in terrestrial 
ecosystems. However, stable isotope data*® do not support this as a 
primary mechanism for the CO, increases associated with northern 
warming, and instead suggest that sea surface temperature change is 
probably at least part of the explanation. The 16.3-kyr CO; rise during 
HS1 is associated with a negative carbon isotopic excursion, implying 
the release of respired carbon**. Drying in the tropics and release of 
respired carbon associated with the southward migration of the ITCZ 
could explain this rapid CO, change and an analogous CO; change dur- 
ing HS4 and possibly HS5 (Fig. 2). This hypothesis is consistent with 
the suggestion that the small increases in atmospheric CH, registered at 
precisely the same time as the HS1 and HS4 CO) increases were caused 
by a southward shift of ITCZ rainfall®. 

At least one further aspect of Antarctic ice core data on short times- 
cales deserves mention—the record of volcanism provided by anom- 
alies in non-marine salt sulphate and the presence of volcanic debris 
(tephra). Ice core records provide highly detailed information about the 
eruption frequency needed to understand the role of volcanic forcing 
on climate. Much progress has been made recently in refining these 
records. For example, identification of synchronous volcanic sulphate 
deposition in the Antarctic and Greenland indicates tropical eruptions 
with the potential to affect climate globally**. Using the new WAIS 
Divide record and new data from Greenland, ref. *° identified five erup- 
tions in the last 2,500 years that were larger than the 1815 Tambora 
event. Volcanic events also provide critical stratigraphic links that are 
improving ice core timescales®”*®, a critical factor for making a unified 
Antarctic ice core chronology that will allow examination of regional 
differences in climate. 


The middle ground: orbital- millennial interaction 

Commonly, orbital-scale and millennial-scale climate change are 
treated and discussed in the literature as separate topics. For example, 
in their seminal paper on the bipolar seesaw, Stocker and Johnson?’ 
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duration, suggesting that the interstadial AMOC mode is unstable. In the 
middle, during intermediate climates such as MIS3, Dansgaard-Oeschger 
interstadials are frequent and of medium duration, resulting in high event 
frequency. On the right, during warm (high CO2) periods such as MIS 5, 
Dansgaard—Oeschger interstadials are of long duration, resulting in a 
lower event frequency, suggesting that the interstadial AMOC mode is 
very stable. Figure modified from ref. ® (Wiley). 


start by filtering out the orbital signal in order to investigate the abrupt 
Dansgaard—Oeschger events. Likewise, studies that seek to understand 
the link between ice volume and insolation (see ref. ! for example) do 
not always consider the influence of abrupt events. This approach is of 
course valid, and much has been learned by studying these timescales 
in isolation. However, capturing the full dynamics of the climate system 
requires consideration of the myriad ways in which millennial- and 
orbital-scale climate change interact, and influence each other. 

For example, the duration of Dansgaard—Oeschger intersta- 
dial phases scales strongly with the mean climate state®’. In Fig. 3a, 
Dansgaard—Oeschger duration is plotted against Antarctic temperature 
and CO}. During relatively warm, high-CO, periods such as MIS 5, 
Dansgaard-Oeschger interstadials tend to be long, suggesting a greater 
stability of the interstadial (or strong) AMOC mode; conversely, dur- 
ing cold, low-CO, periods such as MIS 2 and 4, Dansgaard—Oeschger 
interstadials are infrequent and of short duration, suggesting a weak 
interstadial mode that readily collapses (Fig. 3b). Early studies hypoth- 
esized a controlling influence of continental ice mass”?! but this is 
contradicted by global climate model simulations showing a stronger 
AMOC overturning upon increasing Laurentide ice volume”. Viable 
explanations for this state dependence (Fig. 3) include sea ice dynamics 
of the North Atlantic?**4, the state of the Southern Ocean®? and CO, 
levels (Fig. 3a)!*, none of which are mutually exclusive. 

Conversely, the bipolar seesaw appears to play an important part in 
the orbitally paced glacial cycle. As discussed above, data®® and mod- 
els°® suggest that atmospheric CO, levels, thought to be the global 
amplifier of the glacial cycles, are closely linked to (millennial-scale) 
changes in ocean circulation. Moreover, speleothem records indicate a 
strongly weakened East Asian monsoon during all of the last seven gla- 
cial terminations, interpreted as a southward shift in the ITCZ driven 
by North Atlantic cooling and AMOC cessation”. In this view, glacial 
terminations are simply the most powerful realizations of the (millennial- 
scale) Antarctic AIM events’. It thus appears that the bipolar seesaw 
has an important role in the machinery of glacial cycles, and should be 
considered in answering questions that are commonly placed in the 
‘orbital’ realm, such as the aforementioned problems of the 100-kyr 
cycle and the interhemispheric climate symmetry at the obliquity and 
precession timescales. 

The interdependence of orbital and millennial-scale climate change 
highlights the need to consider and develop theories that are appli- 
cable to both timescales. Transient climate model simulations of the 
last deglaciation that incorporate both orbital and millennial-scale 
AMOC forcings have been highly successful in fitting observations” 
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however, the freshwater fluxes were still prescribed rather than simu- 
lated. Similar coupled ocean-atmosphere climate model experiments 
are needed on all timescales, although the computational cost of such 
an endeavour is a challenge. 


Lessons for a warming world from Antarctic ice cores 
Perhaps the most fundamental message from ice cores is just how 
profound the anthropogenic impact on our atmospheric composition 
has been in the context of long-term natural variability. Levels of the 
primary greenhouse gases CO2, CHy and N20, all of which are directly 
affected by human emissions, are at higher levels than at any time***8? 
in the last 800 kyr. At the time of writing, concentrations of these three 
gases are elevated by about 45%, 155% and 22% over pre-industrial 
values, respectively. Modern changes in CO are also much more rapid 
than in the ice core record. The fastest pre-industrial increases in CO, 
were about 0.1 p.p.m. per year®4, compared to consistent recent growth 
rates close to 2 p.p.m. per year since the early 1990s””. For CHy, the 
modern growth rate is also faster than observed through most of the 
ice core record, although certain natural abrupt transitions match 
the speed of the current increase®*”. Nitrous oxide is not generally 
sampled closely enough to make such comparisons, but its past growth 
rate is unlikely to be as rapid as the current rise. The strong correlation 
of variations in these gases with climate proxies over the last 800 kyr 
verifies the importance of the greenhouse effect in global climate. As 
discussed above, the temperature records themselves confirm the the- 
ory of polar amplification’®, an important feature in future warming 
with implications for Arctic societies and wildlife. 

Recent work shows that rapid changes in climate and global bioge- 
ochemical cycles during the last ice age and deglaciation affected both 
the Southern Hemisphere and the tropics™7>***, with a strong imprint 
on tropical systems during Heinrich stadials*°. However, it is not clear 
what this tells us about what to expect in the future. On the one hand, 
much of the major millennial variability in the ice core record occurred 
under different (that is, glacial) boundary conditions, including lower 
sea level, larger ice sheets and considerably lower CO, levels. The lack 
of those conditions now suggests that in the near future completely 
analogous events are unlikely. On the other hand, there is a consider- 
able need to understand the strengths of fast feedbacks in the climate 
system, including the possibility of rapid mass loss at ice sheet margins, 
the possibility that Greenland meltwater runoff and surface warming 
could affect the Atlantic overturning circulation, and the potential for 
fast changes in the global carbon cycle. 

The ice core record suggests a more vigorous and stable AMOC 
during warmer background climate states (Fig. 3); if this palaeo- 
observation from the last ice age were to apply to future climates 
also (which has not been rigorously tested), it would imply that the 
short-term, transient AMOC weakening driven by freshening of the 
surface North Atlantic may in the long term be offset by an increase 
in equilibrium AMOC strength in a warmer world®?®. Furthermore, 
changes in the ITCZ and westerly winds appear to be part of the 
process that transmits millennial signals to Antarctica. It is therefore 
important to understand their dynamics and impacts better, considering 
the potential for future changes in regions currently supporting a major 
fraction of the global human population. 


The future of Antarctic ice core science 

The continuous ice core record to 800 kyr is a remarkable achievement. 
An extension of this record to earlier times, with a goal of reaching back 
to 1.5 million years Bp, isa major new international priority, with ongo- 
ing searches for appropriate sites”’. Several fundamental questions drive 
this quest. As discussed, ocean sediment records show that before the 
so-called Mid-Pleistocene Transition (1,200-800 kyr ago), global climate 
was dominated by a strong 41-kyr period, the cycle associated with the 
variations in Earth's tilt. After the Mid-Pleistocene Transition the quasi- 
100-kyr variability dominated. One fundamental question is whether 
global temperature was warmer at this time, perhaps resulting in smaller 
and more mobile ice sheets, and shorter ice age cycles”!. A second 
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question is whether Antarctic temperature before the Mid-Pleistocene 
Transition tracked the benthic isotope record, as it does in later times, or 
varied on some other timescale. A third question concerns whether 
changes in the long-term mean atmospheric concentrations of green- 
house gases play a part in changing the frequency of glaciation. 

The search for a suitable location at which to drill for very old ice 
involves major investments in radar remote sensing and rapid-access 
drills!°!-!°3 with which to test probable sites, and the development of 
better measurement methods. Several international groups are setting 
their sights on this goal, and it is likely that more than one record will be 
needed to confirm results. ‘Snapshots’ of older time periods can also be 
achieved by shallow drilling in ice margin regions’, with the potential 
for finding ice older than 2 million years!™. 

There is also much more to learn about long-term climate and bio- 
geochemical cycles in the existing 800,000-year record through more 
detailed measurements, including completion of long isotopic records 
for greenhouse gases and improved (volcanic) synchronization of ice 
cores both within Antarctica and from Greenland. Understanding the 
processes that lead to warm interglacials, the nature of abrupt change 
throughout the record, and the controls on greenhouse gas variability 
and its links to climate change are some obvious goals. The Holocene 
history of Antarctica is also critical. Although we have a fairly compre- 
hensive ice core view of the entire Holocene’”!, the data needed to 
put current global warming in the context of Antarctic changes could 
be improved!” Increasing the spatial coverage of high-quality, well 
dated records would add a great deal to our understanding of this issue. 

One of the largest questions about Antarctica for the near future 
concerns the stability of the West Antarctic Ice Sheet (WAIS). This ice 
sheet is believed to be vulnerable to collapse as it is grounded below 
sea level with a retrograde bedrock slope. A critical question that ice 
core science might answer is whether the ice sheet collapsed during 
the last interglacial, when temperatures and sea level were higher than 
today. No cores in the present WAIS penetrate this period, but given 
the limited number of drilling projects and the high basal melt rate in 
the area, this cannot be taken as evidence that the WAIS was not there. 
Modelling suggests that WAIS collapse could be recorded by distinctive 
patterns in ice core temperature proxies adjacent to the WAIS, related to 
altered atmospheric circulation!”’. The Mount Moulton blue ice site, at 
2,820 m in Marie Byrd Land!%, provides the only interglacial stable iso- 
tope record from West Antarctica; the data are consistent with collapse 
scenarios’”’. Given uncertainties in models and interpretations of blue 
ice sites, and the appropriate boundary conditions for collapse scenar- 
ios, this result is by no means definitive. Additional coring in and next 
to the WAIS will be needed to provide better constraints—sites iden- 
tified for this purpose include Hercules Dome and Skytrain Ice Rise. 

Advancing understanding and measurement of ice core proxies will 
continue to be important. New techniques, for example, deep-ocean 
temperature proxies from noble gases, ‘clumped isotopes’ in atmos- 
pheric gases, mass-independent isotope fractionation in water, sul- 
phate, oxygen and other systems, water isotope diffusion, palaeo-data 
assimilation techniques, better measurements and more detail in heavy 
isotope proxies of dust sources, continuous, centimetre-scale water iso- 
tope and gas analysis, more detailed isotopic measurements of green- 
house gases, and a host of other methods hold promise for revealing 
much more about Antarctic and global environmental change. 

Finally, although there are ice core records throughout Antarctica 
(Box 1), the overall spatial coverage is in fact quite thin considering the 
size and complexity of the continent, and new drilling will continue to 
be needed to advance scientific goals. International cooperation in ice 
core drilling and prioritizing science is strong. The technical exper- 
tise required to accomplish challenging field programmes is available 
in many nations, placing ice core science in an excellent position to 
improve our understanding of the history of Antarctica and its links to 
the larger Earth system. 


Received: 10 November 2017; Accepted: 19 March 2018; 
Published online 13 June 2018. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 
26. 


27. 


28. 


29. 


30. 


31. 


32. 


Galeotti, S. et al. Antarctic Ice Sheet variability across the Eocene-Oligocene 
boundary climate transition. Science 352, 76-80 (2016). 

Flower, B. P. & Kennett, J. P. Relations between Monterey Formation deposition 
and middle Miocene global cooling: Naples Beach section. Calif. Geol. 21, 
877-880 (1993). 

Higgins, J. A. et al. Atmospheric composition 1 million years ago from blue ice 
in the Allan Hills, Antarctica. Proc. Natl Acad. Sci. USA 112, 6887-6891 (2015). 
This study reports the first greenhouse gas data from ice older than 800,000 
years. 

Jouzel, J. et al. Validity of the temperature reconstruction from water isotopes 
in ice cores. J. Geophys. Res. Oceans 102, 26471-26487 (1997). 

Wolff, E., Fischer, H. & Rothlisberger, R. Glacial terminations as southern 
warmings without northern control. Nat. Geosci. 2, 206-209 (2009). 
Schupbach, S. et al. High-resolution mineral dust and sea ice proxy records 
from the Talos Dome ice core. Clim. Past 9, 2789-2807 (2013). 

McConnell, J. R. et al. Antarctic-wide array of high-resolution ice core records 
reveals pervasive lead pollution began in 1889 and persists today. Sci. Rep. 4, 
5848 (2014). 

Brook, E. J., Kurz, M. D. & Curtice, J. Flux and size fractionation of He in 
interplanetary dust from Antarctic ice core samples. Earth Planet. Sci. Lett. 286, 
565-569 (2009). 

Van Ommen, T. D., Morgan, V. & Curran, M. A. Deglacial and Holocene changes 
in accumulation at Law Dome, East Antarctica. Ann. Glaciol. 39, 359-365 
(2004). 

Cuffey, K. M. et al. Deglacial temperature history of West Antarctica. Proc. Nat! 
Acad. Sci. USA 113, 14249-14254 (2016). 

This work provides the first accurate borehole-based temperature 
reconstruction from Antarctica, indicating a glacial-interglacial temperature 
change of 11.3 + 1.8 °C. 

Jouzel, J. et al. Orbital and millennial Antarctic climate variability over the past 
800,000 years. Science 317, 793-796 (2007). 

This paper reports the full 800,000-year temperature reconstruction from 
the EPICA Dome C ice core, the longest such record. 

Dome Fuji Project Members. State dependence of climatic instability over the 
past 720,000 years from Antarctic ice cores and climate modeling. Sci. Adv. 3, 
e1600446 (2017). 

Petit, J. R. et al. Climate and atmospheric history of the past 420,000 years 
from the Vostok ice core, Antarctica. Nature 399, 429-436 (1999). 

Lisiecki, LE. & Raymo, M. E. A Pliocene-Pleistocene stack of 57 globally 
distributed benthic §!80 records. Paleoceanography 20, 1-17 (2005). 

Shakun, J. D. et al. Global warming preceded by increasing carbon dioxide 
concentrations during the last deglaciation. Nat. Geosci. 484, 49-55 (2012). 
Hays, J. D., Imbrie, J. & Shackleton, N. J. Variations in the Earth's orbit: 
pacemaker of the ice ages. Science 194, 1121-1132 (1976). 

Imbrie, J. et al. On the structure and origin of major glaciation cycles. 
Paleoceanogr. Paleoclimatol. 8, 699-735 (1993). 

Raymo, M. The timing of major climate terminations. Paleoceanogr. 
Paleoclimatol. 12, 577-585 (1997). 

Paillard, D. The timing of Pleistocene glaciations from a simple multiple-state 
climate model. Nature 391, 378-381 (1998). 

Cheng, H. et al. The Asian monsoon over the past 640,000 years and ice age 
terminations. Nature 534, 640-646 (2016). 

Tzedakis, P., Crucifix, M., Mitsui, T. & Wolff, E. W. A simple rule to determine 
which insolation cycles lead to interglacials. Nature 542, 427-432 (2017). 
Bintanja, R. & Van de Wal, R. North American ice-sheet dynamics and the 
onset of 100,000-year glacial cycles. Nature 454, 869-872 (2008). 
Abe-Ouchi, A. et al. Insolation-driven 100,000-year glacial cycles and 
hysteresis of ice-sheet volume. Nature 500, 190-193 (2013). 

Ganopolski, A. & Brovkin, V. Simulation of climate, ice sheets and CO2 
evolution during the last four glacial cycles with an Earth system model of 
intermediate complexity. Clim. Past 13, 1695-1716 (2017). 

Broecker, W. S. & Denton, G. H. The role of ocean-atmosphere reorganizations 
in glacial cycles. Quat. Sci. Rev. 9, 305-341 (1990). 

Kawamura, K. et al. Northern Hemisphere forcing of climatic cycles in 
Antarctica over the past 360,000 years. Nature 448, 912-916 (2007). 

This study links variations in the O2/N2 ratio of trapped air in the Dome Fuji 
ice core to local summer insolation, thereby dating the core and showing 
that glacial terminations in Antarctica closely followed Northern 
Hemisphere summer insolation. 

Stocker, T. F. & Johnsen, S. J. A minimum thermodynamic model for the 
bipolar seesaw. Paleoceanogr. Paleoclimatol. 18, https://doi. 
org/10.1029/2003PA000920 (2003). 

Huybers, P. & Denton, G. Antarctic temperature at orbital timescales controlled 
by local summer duration. Nat. Geosci. 1, 787-792 (2008). 

WAIS Divide Project Members. Onset of deglacial warming in West Antarctica 
driven by local orbital forcing. Nature 500, 440-444 (2013). 

Loulergue, L. et al. Orbital and millennial-scale features of atmospheric CH4 
over the past 800,000 years. Nature 453, 383-386 (2008). 

This paper reports the full 800,000-year atmospheric methane record from 
the EPICA Dome C ice core, showing variations on orbital and millennial 
timescales. 

Yin, Q. Insolation-induced mid-Brunhes transition in Southern Ocean 
ventilation and deep-ocean temperature. Nature 494, 222-225 (2013). 

Wolff, E. W. et al. Southern Ocean sea-ice extent, productivity and iron flux over 
the past eight glacial cycles. Nature 440, 491-496 (2006). 

Chemical measurements from the EPICA Dome C ice core indicate that both 
the flux of iron (from wind-blown dust) and sea ice extent increased during 
glacial periods over the past 740,000 years. 


33. 


34. 


35. 


36. 


37. 
38. 


39. 


40. 


41. 


42. 


43. 


44. 


45. 


46. 


47. 


48. 


49. 


50. 


51. 


52. 


53. 


54. 


55. 


56. 


57. 


58. 


59. 
60. 


61. 


62. 


63. 


64. 


REVIEW 


Lambert, F. et al. Dust-climate couplings over the past 800,000 years from the 
EPICA Dome C ice core. Nature 452, 616-619 (2008). 

Fischer, H., Siggaard-Andersen, M. L., Ruth, U., Rothlisberger, R. & Wolff, E. 
Glacial/interglacial changes in mineral dust and sea-salt records in polar ice 
cores: sources, transport, and deposition. Rev. Geophys. 45, 1-26 (2007). 
Martinez-Garcia, A. et al. Links between iron supply, marine productivity, sea 
surface temperature, and COz over the last 1.1 Ma. Paleoceanogr. 
Paleoclimatol. 24, PA1207 (2009). 

Jaccard, S. et al. Two modes of change in Southern Ocean productivity over 
the past million years. Science 339, 1419-1423 (2013). 

Abram, N. J., Wolff, E. W. & Curran, M.A. A review of sea ice proxy information 
from polar ice cores. Quat. Sci. Rev. 79, 168-183 (2013). 

Luthi, D. et al. High-resolution carbon dioxide concentration record 
650,000-800,000 years before present. Nature 453, 379-382 (2008). 

This paper completed the 800,000-year-long EPICA Dome C CO> record 
shown in Fig. 1, demonstrating the variation of CO. maxima during 
interglacial times. 

Schilt, A. et al. Glacial-interglacial and millennial-scale variations in the 
atmospheric nitrous oxide concentration during the last 800,000 years. Quat. 
Sci. Rev. 29, 182-192 (2010). 

Sigman, D. M. & Boyle, E. A. Glacial/interglacial variations in atmospheric 
carbon dioxide. Nature 407, 859-869 (2000). 

Sigman, D. M., Hain, M. P. & Haug, G. H. The polar ocean and glacial cycles in 
atmospheric COz concentration. Nature 466, 47-55 (2010). 

Stephens, B. B. & Keeling, R. F. The influence of Antarctic sea ice on 
glacial-interglacial CO2 variations. Nature 404, 171-174 (2000). 

Skinner, L., Fallon, S., Waelbroeck, C., Michel, E. & Barker, S. Ventilation of the 
deep Southern Ocean and deglacial COz rise. Science 328, 1147-1151 (2010). 
Ferrari, R. et al. Antarctic sea ice control on ocean circulation in present and 
glacial climates. Proc. Nat! Acad. Sci. USA 111, 8753-8758 (2014). 

Martin, J. H. Glacial-interglacial CO2 change: the iron hypothesis. Paleoceanogr. 
Paleoclimatol. 5, 1-13 (1990). 

Franois, R. et al. Contribution of Southern Ocean surface-water stratification to 
low atmospheric CO2 concentrations during the last glacial period. Nature 
389, 929-935 (1997). 
Anderson, R. et al. Wind-driven upwelling in the Southern Ocean and the 
deglacial rise in atmospheric CO2. Science 323, 1443-1448 (2009). 

Bauska, T. K. et al. Carbon isotopes characterize rapid changes in atmospheric 
carbon dioxide during the last deglaciation. Proc. Natl Acad. Sci. USA 113, 
3465-3470 (2016). 

Galbraith, E. & Eggleston, S. A lower limit to atmospheric COz concentrations 
over the past 800,000 years. Nat. Geosci. 10, 295-298 (2017). 

Brook, E. J., Sowers, T. & Orchardo, J. Rapid variations in atmospheric methane 
concentration during the past 110,000 years. Science 273, 1087-1091 (1996). 
Petrenko, V. V. et al. Minimal geological methane emissions during the Younger 
Dryas-Preboreal abrupt warming event. Nature 548, 443-446 (2017). 

Sowers, T. Late quaternary atmospheric CH, isotope record suggests marine 
clathrates are stable. Science 311, 838-840 (2006). 

Bock, M. et al. Hydrogen isotopes preclude marine hydrate CH, emissions at 
the onset of Dansgaard-Oeschger events. Science 328, 1686-1689 (2010). 
Levine, J. et al. Reconciling the changes in atmospheric methane sources and 
sinks between the Last Glacial Maximum and the pre-industrial era. Geophys. 
Res. Lett. 38, L23804 (2011). 

urray, L. T. et al. Factors controlling variability in the oxidative capacity of the 
troposphere since the Last Glacial Maximum. Atmos. Chem. Phys. 14, 
3589-3622 (2014). 

Fischer, H. et al. Changing boreal methane sources and constant biomass 
burning during the last termination. Nature 452, 864-867 (2008). 

Brook, E., Archer, D., Dlugokencky, E., Frolking, S. & Lawrence, D. in Abrupt 
Climate Change. A report by the U.S. Climate Change Science Program and the 
Subcommittee on Global Change Research (ed. McGeehin, J. P.) Ch. 5, 163-201 
(US Geological Survey, Reston, 2008). 

Schmittner, A. & Galbraith, E. D. Glacial greenhouse-gas fluctuations controlled 
by ocean circulation changes. Nature 456, 373-376 (2008). 

Schilt, A. et al. Isotopic constraints on marine and terrestrial N20 emissions 
during the last deglaciation. Nature 516, 234-237 (2014). 

Severinghaus, J. P., Beaudette, R., Headly, M.A., Taylor, K. & Brook, E. J. 
Oxygen-18 of O2 records the impact of abrupt climate change on the 
terrestrial biosphere. Science 324, 1431-1434 (2009). 

Stolper, D., Bender, M., Dreyfus, G., Yan, Y. & Higgins, J. A Pleistocene ice core 
record of atmospheric O2 concentrations. Science 353, 1427-1430 (2016). 
Bender, M. L., Barnett, B., Dreyfus, G., Jouzel, J. & Porcelli, D. The contemporary 
degassing rate of 4°Ar from the solid Earth. Proc. Nat! Acad. Sci. USA 105, 
8232-8237 (2008). 

Precise measurements of argon isotope ratios in trapped air are used in this 
study to develop a chronometer for old ice based on the accumulation of 4°Ar 
in the atmosphere from “°K decay in the crust. 

Headly, M. A. & Severinghaus, J. P. A method to measure Kr/Nz ratios in air 
bubbles trapped in ice cores and its application in reconstructing past mean 
ocean temperature. J. Geophys. Res. 112, D19105 (2007). 

Bereiter, B., Shackleton, S., Baggenstos, D., Kawamura, K. & Severinghaus, J. 
Mean global ocean temperatures during the last glacial transition. Nature 553, 
39-44 (2018). 

Measurements of Kr, Xe, Ar, and N are used in this study to make the first 
precise estimates of changes in global deep-ocean temperature across 

the last glacial-interglacial transition, showing that these are mostly 
synchronous with changes in Antarctic air temperature and atmospheric 
CO. 


14 JUNE 2018 | VOL 558 | NATURE | 207 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


REVIEW 


65. 
66. 


67. 
68. 


69. 
70. 
71. 
72. 


73. 


74. 
75. 
76. 
77. 


78. 


79: 
80. 
81. 


82. 


83. 
84. 


85. 
86. 
87. 


88. 
89. 
90. 
91. 


92. 
93. 


Grootes, P,, Stuiver, M., White, J., Johnsen, S. & Jouzel, J. Comparison of oxygen 
isotope records from the GISP2 and GRIP Greenlandice cores. Nature 366, 
552-554 (1993). 

Dansgaard, W. et al. Evidence for general instability of past climate from a 
250-kyr ice-core record. Nature 364, 218-220 (1993). 

This paper reports the first detailed record of abrupt changes in temperature 
in Greenland. 

Andersen, K. K. et al. High-resolution record of Northern Hemisphere climate 
extending into the last interglacial period. Nature 431, 147-151 (2004). 
Blunier, T. & Brook, E. J. Timing of millennial-scale climate change in Antarctica 
and Greenland during the last glacial period. Science 291, 109-112 (2001). 
This paper used methane variations to make a common timescale for ice 
cores in Greenland and Antarctica, and showed the bi-polar seesaw pattern 
for the larger climate variations during the last ice age. 

Brook, E. J. et al. Timing of millennial-scale climate change at Siple Dome, 
West Antarctica, during the last glacial period. Quat. Sci. Rev. 24, 1333-1343 
(2005). 

EPICA Community Members. One-to-one coupling of glacial climate variability 
in Greenland and Antarctica. Nature 444, 195-198 (2006). 

This paper demonstrated that all abrupt climate events in the Greenland 
record have counterparts in Antarctica. 

Stenni, B. et al. Expression of the bipolar see-saw in Antarctic climate records 
during the last deglaciation. Nat. Geosci. 4, 46-49 (2011). 

Landais, A. et al. A review of the bipolar see-saw from synchronized and high 
resolution ice core water stable isotope records from Greenland and East 
Antarctica. Quat. Sci. Rev. 114, 18-32 (2015). 

WAIS Divide Project Members. Precise interpolar phasing of abrupt climate 
change during the last ice age. Nature 520, 661-665 (2015). 

Using very precise chronological constraints, this paper demonstrated that 
millennial-scale warming and cooling in Antarctica lagged counterpart 
events in Greenland by about 200 years on average. 

Bender, M. et al. Climate correlations between Greenland and Antarctica 
during the past 100,000 years. Nature 372, 663-666 (1994). 

Broecker, W. S. Paleocean circulation during the last deglaciation: a bipolar 
seesaw? Paleoceanogr. Paleoclimatol. 13, 119-121 (1998). 

Vellinga, M. & Wood, R. A. Global climatic impacts of a collapse of the Atlantic 
thermohaline circulation. Clim. Change 54, 251-267 (2002). 

Schmittner, A., Saenko, O. & Weaver, A. Coupling of the hemispheres in 
observations and simulations of glacial climate change. Quat. Sci. Rev. 22, 
659-671 (2003). 

Stouffer, R. J. et al. Investigating the causes of the response of the 
hermohaline circulation to past and future climate changes. J. Clim. 19, 
1365-1387 (2006). 

arkle, B. R. et al. Global atmospheric teleconnections during Dansgaard- 
Oeschger events. Nat. Geosci. 10, 36-40 (2017). 

asson-Delmotte, V. et al. Abrupt change of Antarctic moisture origin at the 
end of Termination Il. Proc. Nat! Acad. Sci. USA 107, 12091-12094 (2010). 
Lambert, F., Bigler, M., Steffensen, J. P., Hutterli, M. & Fischer, H. Centennial 
mineral dust variability in high-resolution ice core data from Dome C, 
Antarctica. Clim. Past 8, 609-623 (2012). 

enviel, L., England, M. H., Meissner, K., Mouchet, A. & Yu, J. Atlantic-Pacific 
seesaw and its role in outgassing CO2 during Heinrich events. Paleoceanogr. 
Paleoclimatol. 29, 58-70 (2014). 

Ahn, J., Brook, E. J., Schmittner, A. & Kreutz, K. Abrupt change in atmospheric 
COz during the last ice age. Geophys. Res. Lett. 39, GLO53018 (2012). 

Marcott, S. A. et al. Centennial-scale changes in the global carbon cycle during 
the last deglaciation. Nature 514, 616-619 (2014). 

The most detailed CO, record for the deglaciation to date is reported from 
the WAIS Divide ice core in this paper, showing the tight coupling of CO2 and 
Antarctic climate (Box 2). 

Rhodes, R. H. et al. Enhanced tropical methane production in response to 
iceberg discharge in the North Atlantic. Science 348, 1016-1019 (2015). 

Sigl, M. et al. Timing and climate forcing of volcanic eruptions for the past 
2,500 years. Nature 523, 543-549 (2015). 

Veres, D. et al. The Antarctic ice core chronology (AICC2012): an optimized 
multi-parameter and multi-site dating approach for the last 120 thousand 
years. Clim. Past 9, 1733-1748 (2013). 

Bazin, L. et al. An optimized multi-proxy, multi-site Antarctic ice and gas orbital 
chronology (AICC2012): 120-800 ka. Clim. Past Discuss. 8, 5963-6009 
(2012). 

Buizert, C. & Schmittner, A. Southern Ocean control of glacial AMOC stability 
and Dansgaard-Oeschger interstadial duration. Paleoceanogr. Paleoclimatol. 
30, 1595-1612 (2015). 

McManus, J. F., Oppo, D. W. & Cullen, J. L. A. 0.5-million-year record of 
millennial-scale climate variability in the North Atlantic. Science 283, 971-975 
(1999). 

Schulz, M., Berger, W. H., Sarnthein, M. & Grootes, P.M. Amplitude variations of 
1470-year climate oscillations during the last 100,000 years linked to 
fluctuations of continental ice mass. Geophys. Res. Lett. 26, 3385-3388 (1999). 
Muglia, J. & Schmittner, A. Glacial Atlantic overturning increased by wind 
stress in climate models. Geophys. Res. Lett. 42, 9862-9868 (2015). 

Oka, A., Hasumi, H. & Abe-Ouchi, A. The thermal threshold of the Atlantic 
meridional overturning circulation and its control by wind stress forcing 
during glacial climate. Geophys. Res. Lett. 39,GLO51421 (2012). 


208 | NATURE | VOL 558 | 14 JUNE 2018 
© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


94. Wang, Z. & Mysak, L. A. Glacial abrupt climate changes and Dansgaard- 
Oeschger oscillations in a coupled climate model. Paleoceanogr. Paleoclimatol. 
21, PAO01238 (2006). 

95. Ahn, J. & Brook, E. J. Atmospheric COz2 and climate on millennial time scales 
during the last glacial period. Science 322, 83-85 (2008). 

96. Liu, Z. etal. Transient simulation of last deglaciation with a new mechanism for 
Balling-Allerad warming. Science 325, 310-314 (2009). 

97. Dlugokencky, E. Trends in Atmospheric Methane. http://www.esrl.noaa.gov/ 
gmd/ccgg/trends_ch4/ (Earth System Research Laboratory, 2018). 

98. Togegweiler, J. & Russell, J. Ocean circulation in a warming climate. Nature 451, 
286-288 (2008). 

99. Fischer, H. et al. Where to find 1.5 million yr old ice for the IPICS “Oldest-Ice” 
ice core. Clim. Past 9, 2489-2505 (2013). 

00. Raymo, M., Lisiecki, L. & Nisancioglu, K. H. Plio-Pleistocene ice volume, 
Antarctic climate, and the global §!80 record. Science 313, 492-495 (2006). 

01. Schwander, J., Marending, S., Stocker, T. & Fischer, H. RADIX: a minimal- 
resources rapid-access drilling system. Ann. Glaciol. 55, 34-38 (2014). 

02. Alemany, O. et al. The SUBGLACIOR drilling probe: concept and design. 

Ann. Glaciol. 55, 233-242 (2014). 

03. Goodge, J. W. & Severinghaus, J. P. Rapid Access Ice Drill: a new tool for 
exploration of the deep Antarctic ice sheets and subglacial geology. J. Glaciol. 
62, 1049-1064 (2016). 

04. Yan, Y. N. J. et al. 2.7-Million-Year-Old Ice from Allan Hills Blue Ice Areas, 

East Antarctica Reveals Climate Snapshots Since Early Pleistocene. 
Goldschmidt Conf. (Paris, France) 4359, https://goldschmidtabstracts. 
info/2017/4359.pdf (European Association of Geochemistry and the 
Geochemical Society, 2007). 

05. Masson-Delmotte, V. et al. A comparison of the present and last interglacial 
periods in six Antarctic ice cores. Clim. Past 7, 397-423 (2011). 

06. Pages 2k Consortium. Continental-scale temperature variability during the 
past two millennia. Nat. Geosci. 6, 339-350 (2013). 

07. Steig, E. J. et al. Influence of West Antarctic Ice Sheet collapse on Antarctic 
surface climate. Geophys. Res. Lett. 42, 4862-4868 (2015). 

08. Korotkikh, E. V. et al. The last interglacial as represented in the glaciochemical 
record from Mount Moulton Blue Ice Area, West Antarctica. Quat. Sci. Rev. 30, 
1940-1947 (2011). 

09. Bereiter, B. et al. Revision of the EPICA Dome C COz record from 800 to 

600 kyr before present. Geophys. Res. Lett. 42, 542-549 (2015). 

0. Bereiter, B. et al. Mode change of millennial CO2 variability during the last 
glacial cycle associated with a bipolar marine carbon seesaw. Proc. Natl Acad. 
Sci. USA 109, 9755-9760 (2012). 

1. Gow, A. J., Ueda, H. T. & Garfield, D. E. Antarctic ice sheet: preliminary results of 
first core hole to bedrock. Science 161, 1011-1013 (1968). 

2. EPICA Community Members. Eight glacial cycles from an Antarctic ice core. 
Nature 429, 623-628 (2004). 

3. Slawny, K. R. et al. Production drilling at WAIS Divide. Ann. Glaciol. 55, 147-155 
(2014). 

114. Neftel, A., Oeschger, H., Staffelbach, T. & Stauffer, B. CO2 record in the Byrd ice 
core 50,000-5,000 years BP. Nature 331, 609-611 (1988). 

5. Barnola, J. M., Pimienta, P., Raynaud, D. & Korotkevich, Y. S. CO2-climate 
relationship as deduced from the Vostok Ice Core—a reexamination based on 
new measurements and on a reevaluation of the air dating. Tellus B 43, 83-90 
(1991). 

6. Fischer, H., Wahlen, M., Smith, J., Mastroianni, D. & Deck, B. Ice core records of 
atmospheric CO2 around the last three glacial terminations. Science 283, 
1712-1714 (1999). 

7. Monnin, E. et al. Atmospheric COz2 concentrations over the last glacial 
termination. Science 291, 112-114 (2001). 

8. Pedro, J. B., Rasmussen, S. O. & van Ommen, T. D. Tightened constraints on 
the time-lag between Antarctic temperature and COz during the last 
deglaciation. Clim. Past 8, 1213-1221 (2012). 

9. Parrenin, F. et al. Synchronous change of atmospheric CO2 and Antarctic 
temperature during the last deglacial warming. Science 339, 1060-1063 
(2013). 


Acknowledgements We thank J. Pedro for comments that improved the 
manuscript. The US National Science Foundation and US Antarctic Program 
have provided support for our research and acquisition of Antarctic ice cores 
that we have studied; we thank them, as well as numerous international 
agencies and colleagues who have contributed to ice core science. 


Reviewer information Nature thanks J. Pedro and the other anonymous 
reviewer(s) for their contribution to the peer review of this work. 


Author contributions The authors contributed equally to this work. 
Competing interests The authors declare no competing interests. 


Additional information 

Reprints and permissions information is available at http://www.nature.com/ 
reprints. 

Correspondence and requests for materials should be addressed to E.J.B. 
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional 
claims in published maps and institutional affiliations. 


REVIEW 


https://doi.org/10.1038/s41586-018-0182-3 


The global influence of localized 
dynamics in the Southern Ocean 


Stephen R. Rintoul!* 


The circulation of the Southern Ocean connects ocean basins, links the deep and shallow layers of the ocean, and has a 
strong influence on global ocean circulation, climate, biogeochemical cycles and the Antarctic Ice Sheet. Processes that 
act on local and regional scales, which are often mediated by the interaction of the flow with topography, are fundamental 
in shaping the large-scale, three-dimensional circulation of the Southern Ocean. Recent advances provide insight into 
the response of the Southern Ocean to future change and the implications for climate, the carbon cycle and sea-level rise. 


(56° S-58° S), unblocked by continents. As a consequence of this 

unique geometry, the Southern Ocean influences the ocean cir- 
culation and climate on global scales. The multiple jets of the Antarctic 
Circumpolar Current (ACC), the largest ocean current, flow from west 
to east through this channel and connect the ocean basins (Fig. 1a). 
The unblocked channel thus enhances interbasin exchange, but inhib- 
its north-south exchange because there can be no net meridional geo- 
strophic flow in the absence of zonal pressure gradients supported by 
land barriers or topography. Surfaces of constant density rise steeply to 
the south, in geostrophic balance with the strong eastward flow of the cir- 
cumpolar current. The steeply sloping isopycnals (see Box 1 for a glossary 
of terms used) provide a reservoir of potential energy that is extracted 
by baroclinic instability to drive the vigorous eddy field that is evident 
in Fig. la. Eddies, in turn, transport fluid and tracers across the ACC. 
In particular, deep water spreads polewards and rises along the sloping 
isopycnals, reaching the sea surface near Antarctica’ (Fig. 1b). Vigorous 
interactions between the atmosphere, ocean and cryosphere transform 
upwelled deep waters into either dense Antarctic Bottom Water (AABW) 
or lighter intermediate waters (Subantarctic Mode Water (SAMW) or 
Antarctic Intermediate Water (AAIW)). The result of these water-mass 
transformations is an overturning circulation that consists of two cells: an 
upper cell in which dense deep water is converted to lighter waters, and 
a lower cell in which deep water is converted to denser bottom water!4 
(Fig. 1b). The poleward flow of deep water is balanced by equatorward 
flow of intermediate and bottom waters formed in the Southern Ocean. 
By connecting the ocean basins and the deep and shallow layers of the 
ocean, the Southern Ocean circulation allows a global-scale ocean over- 
turning in which conversion of deep water to intermediate water in the 
Southern Ocean largely compensates the sinking of deep water in the 
North Atlantic. The global overturning circulation, in turn, largely sets 
the capacity of the ocean to store and transport heat and carbon dioxide 
and thereby influence climate**. 

Several characteristics set the circulation of the Southern Ocean 
apart from ocean currents in other regions. Weak stratification and 
strong eastward flow driven by powerful winds over the Southern 
Ocean combine to establish momentum and vorticity balances that 
differ from those at lower latitudes. In particular, the adjustment pro- 
cess that limits the depth of the wind-driven circulation at lower lati- 
tudes does not operate in the ACC”. In the subtropics, changes in wind 
generate planetary waves that propagate west and gradually establish a 


T he ocean circles the globe in the latitude band of Drake Passage 


new equilibrium circulation of the gyres found in the upper kilometre 
or so of the water column”. In the ACC, the eastward flow is much 
faster than the westward propagation of the waves. As a consequence, 
the current extends to great depth and the flow is strongly influenced 
by topography. 

Substantial progress in understanding the dynamics of the Southern 
Ocean and its role in the climate system has been made by adopting 
the zonally averaged perspective shown schematically in Fig. 1b. Here 
I describe new insights gained from observations, theory and models 
that highlight the limitations of the zonal-mean view and the impor- 
tance of local and regional dynamics. (‘Local’ in this context refers to 
processes that are initiated in a particular area, over spatial scales of 
tens to hundreds of kilometres, but whose influence on the flow may 
extend hundreds or thousands of kilometres downstream.) Recent 
studies have also elucidated the pathways responsible for sequestering 
heat and carbon and their unanticipated sensitivity to fluctuations in 
climate forcing. The processes that drive the overturning circulation are 
better understood, including appreciation of the contribution of fresh- 
water transport by sea-ice formation and melt. The extent to which the 
fate of the Antarctic Ice Sheet is linked to changes in the surrounding 
ocean has come into sharper focus in the past decade. As observa- 
tional records have increased in length and coverage, the nature and 
drivers of variability and change in the Southern Ocean have become 
better understood. Taken together, these recent developments provide 
the foundation for a new conceptual model of the Southern Ocean, in 
which the large-scale circulation that is of such importance to global 
climate emerges from dynamics that play out on local and regional 
scales, catalysed by topography. 


Zonal-average dynamics of the Southern Ocean 

As illustrated in Fig. 1, the circulation of the Southern Ocean con- 
sists of two main elements: the strong eastward flow of the ACC and 
a weaker overturning circulation that carries water towards or away 
from the Antarctic continent. By the early 2000s, sufficient progress 
had been made to allow articulation of a dynamical recipe for these two 
dominant aspects of the Southern Ocean circulation and their interac- 
tion, as summarized in recent reviews!'-!3. The essential ingredients 
include a circumpolar channel with realistic bathymetry, driven at the 
surface by wind and buoyancy forcing. Strong westerly winds drive 
the northward transport of surface waters in the Ekman layer, with 
convergence (downwelling) north of the wind-stress maximum and 
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Fig. 1 | The Southern Ocean circulation. a, Five-day-mean current speed 
at a depth of 112 m from a high-resolution (1/12°) numerical simulation 
of the ACC, illustrating the filamented, eddy-rich structure of the current. 
Image adapted from ref. !!° (movie $1 in the supporting information), 
American Geophysical Union. b, A highly schematic illustration of the 
Southern Ocean overturning circulation. Deep water spreads polewards 
and upwards across the ACC. Water that upwells close to Antarctica 

is converted to denser bottom water by cooling and brine released 

during sea-ice formation (indicated by white dots in the upper right of 

the diagram). Water that upwells further north is converted to lighter 
mode and intermediate waters that sink to the north of the current and 
ventilate the intermediate depths of the ocean. The schematic omits many 
important aspects of the Southern Ocean circulation, including wind and 
buoyancy forcing, eddies and mixing processes. Image adapted from ref. 1! 
(https://doi.org/10.26749/rstpp.133.3.41, figure 6), Royal Society of Tasmania. 


divergence (upwelling) south of it’’. The winds therefore cause isopyc- 


nals to slope upwards to the south, establishing an eastward geostrophic 
current. Once the isopycnals are sufficiently steep, they become baro- 
clinically unstable and form eddies'”. The vigorous eddy field plays a 
central part in Southern Ocean dynamics. Eddies transfer momentum 
vertically from the sea surface to the sea floor!!!”, where bottom-form 
stress balances the wind stress'+. Downward flux of momentum by 
eddies is associated with poleward transport of heat, and eddies make 
the dominant contribution to the poleward heat flux that is needed to 
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balance the heat lost to the atmosphere at high latitudes!°. Buoyancy 
forcing can also influence ACC transport by altering the stratification 
and cross-stream density gradient!”'®. 

Although early measurements of temperature and salinity were suf- 
ficient to reveal the existence of an overturning circulation that carries 
saline deep water towards Antarctica and a return flow of fresher waters 
near the sea floor and in the upper ocean!” (Fig. 1b), the dynamics of 
the overturning and its connection to the ACC remained obscure for 
many decades. Progress in the late twentieth century revealed that the 
overturning circulation is in fact intimately linked to the dynamics 
of the ACC and its eddy field. Because eddies carry mass polewards, 
allowing meridional transport across the unbounded channel of the 
Southern Ocean at depths above the shallowest topography, the eddy 
field associated with the unstable flow of the ACC is connected directly 
to the overturning circulation. In a stably stratified ocean, a zonal-mean 
overturning circulation can exist only if buoyancy is added or removed 
to convert water from one density class to another (for example, con- 
version of dense deep water to lighter mode and intermediate water in 
the upper cell requires an input of buoyancy, whereas conversion of 
deep water to denser bottom water in the lower cell requires removal of 
buoyancy). As a consequence, the strength of the overturning is related 
directly to the buoyancy forcing at the sea surface*"’. 

A key question is how the circulation of the Southern Ocean, both 
the ACC and the overturning, responds to changes in forcing by the 
atmosphere (that is, changes in wind stress at the sea surface or in air- 
sea exchange of heat and moisture). Two concepts have dominated 
recent discussion of the response of the Southern Ocean to changes in 
forcing: ‘eddy saturation’ and ‘eddy compensation: In the eddy-satura- 
tion limit, stronger wind forcing results in a more vigorous eddy field, 
with little change in ACC transport'®. Observations of little change in 
the isopycnal slope across the ACC despite increasing westerly winds 
have been taken as evidence that the ACC is close to an eddy-saturation 
regime’”, as also seen in eddy-resolving numerical simulations”? ~”. 
Eddy compensation refers to the tendency for eddy mass transports to 
counter the wind-driven overturning circulation. In the limit of com- 
plete eddy compensation, the increase in northward Ekman transport 
in response to stronger winds is balanced by increased southward 
eddy mass transport. Eddy-resolving numerical simulations suggest 
that eddies only partially compensate the wind-driven circulation?” ””. 
In addition, eddy fluxes and Ekman transport act at different depths 
and transport different water masses”>. 

In summary, substantial progress in the past two decades established 
a conceptual framework for the Southern Ocean, in which both wind 
and buoyancy forcing drive the circulation, the ACC and overturn- 
ing are dynamically intertwined, eddies have a key role in establishing 
zonal-average dynamical balances, and interaction of flow with the 
sea floor balances forcing at the sea surface. However, this model fell 
short of a predictive theory for the response of the Southern Ocean to 
changes in forcing. 


Zonal asymmetry and regional dynamics 
The zonal-average perspective illuminated many aspects of Southern 
Ocean dynamics, including how eddies provide a dynamical con- 
nection between the ACC and the overturning circulation. However, 
the Southern Ocean is not zonally uniform, and these asymmetries 
provide clues to missing physics. For example, although eddies are 
central to Southern Ocean dynamics, the distribution of eddy kinetic 
energy is not uniform, with relatively low levels over flat abyssal plains 
and elevated levels downstream of topography. Recent studies have 
shown how many of the key physical processes relevant to Southern 
Ocean dynamics and climate are focused in ‘hot spots, which are often 
linked to bottom topography. Figure 2 provides a schematic overview 
of the processes at work when the ACC encounters a topographic 
obstacle. 

Upstream of the topography, the flow is largely zonal, eddy kinetic 
energy and eddy fluxes are low, vertical motion and cross-front 
exchange are suppressed, and the ACC fronts are distinct!*. As the ACC 
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Baroclinic and barotropic Baroclinic flows vary with depth; 
barotropic flows are independent of depth. 

Baroclinic instability A baroclinic instability is an instability of an 
atmospheric or oceanic flow that releases potential energy from 

the mean flow and increases the energy of the eddy field. The more 
rapidly the flow varies with depth (or, equivalently, the steeper the 
slope of isopycnals across the current), the more unstable the flow. 
Bottom-form stress Bottom-form stress is a stress that is exerted on 
the sea floor by the flow, proportional to the difference in pressure at 
constant depth on either side of a topographic feature. 
Bottom-pressure torque Bottom-pressure torque is a torque that is 
exerted on the sea floor by the flow, proportional to the difference in 
pressure along topography, at constant depth. 

Buoyancy Buoyancy is added to the ocean by heating or by the 
input of fresh water by precipitation or ice melt; it is removed by 
cooling or by removal of fresh water by evaporation or the formation 
of sea ice. 

Diapycnal and isopycnal Diapycnal refers to the direction 
perpendicular to surfaces of constant density (that is, isopycnals); 
isopycnal processes act along surfaces of constant density. 

Eddy Ocean eddies are deviations from the mean flow, where the 
mean field can be defined by a temporal average (transient eddies) 
or a spatial average (stationary eddies). Eddies mostly transport 
properties along isopycnals. Baroclinic eddies are produced by 
baroclinic instability of the mean flow. 

Ekman The Ekman layer is the surface layer of the ocean that is 
forced directly by the wind. In the Ekman layer, the drag force that 

is exerted by the wind stress is balanced by the pressure-gradient 
force and the Coriolis force. The Ekman transport is at right angles 
and to the left of the wind in the Southern Hemisphere, looking 
down-wind. Horizontal gradients in wind stress result in divergence 
or convergence of Ekman transport and hence upwelling or 
downwelling. 

Forcing Ocean circulation is driven by the stress of the wind blowing 
on the sea surface and by factors that affect the buoyancy of the 
surface ocean (see ‘buoyancy’). 

Geostrophic balance Large-scale flows in the ocean and 
atmosphere are close to geostrophic balance, whereby the pressure- 
gradient force is balanced by the Coriolis force. 

Kelvin wave A Kelvin wave is a low-frequency gravity wave that is 
trapped to a land boundary or to the Equator. 

Meridional and zonal Meridional refers to the north-south direction; 
zonal refers to the east-west direction. 

Planetary waves Planetary waves or Rossby waves are wave 
motions that result from the conservation of potential vorticity 

and the fact that the Coriolis effect varies with latitude. Planetary 
waves have westward phase velocity and transfer information about 
changes in forcing. 

Potential vorticity Vorticity refers to the rotation of a fluid element 
and includes contributions from horizontal gradients of velocity 
(relative vorticity) and from the Earth’s rotation (planetary vorticity). 
Potential vorticity tends to be conserved by oceanographic flows, in 
the absence of dissipation. 

Topography Topography is the bathymetry, or varying depth, of the 
sea floor. 

Tracer A tracer is a generic property (namely, temperature or 
salinity) that is transported by ocean currents and mixing processes. 


encounters topography, stretching or squashing of the water column 
generates vorticity that is balanced by meridional motion (a conse- 
quence of the tendency of the flow to conserve angular momentum). 
Where the flow crosses isobaths from deep to shallow, bottom-pressure 
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torque turns the flow equatorward and drives upwelling; where the 
deep flow crosses from shallow to deep, currents are turned poleward 
and associated with downwelling®'!*°. (Non-zero bottom-pressure 
torque implies a pressure difference along isobaths and hence a geo- 
strophic flow towards or away from the boundary, which must be 
balanced by upslope or downslope flow’.) Pressure differences across 
topographic obstacles create stresses that, in the circumpolar integral, 
balance the wind stress at the sea surface’*. Similarly, bottom-pressure 
torques balance the vorticity supplied by the curl of the wind stress, 
when integrated around the Southern Ocean. But the zonally integrated 
balances conceal the highly non-uniform distribution of bottom-form 
stress and bottom-pressure torque, which are concentrated where the 
ACC interacts with topography”. Eddy vorticity fluxes exert torques 
that turn the jets and help the ACC to navigate complex topography”*. 
Topographic steering often causes jets to converge near topography, 
steepening isopycnals”’. Although steeper isopycnals are more prone 
to baroclinic instability, sloping bathymetry provides a stabilizing influ- 
ence because of the tendency to conserve potential vorticity and hence 
flow along, rather than across, bathymetric contours. Where deep gaps 
provide a pathway through ridges or other bathymetric obstacles, the 
ACC jets converge to pass through the gap, with eddies accelerating the 
deep flow (that is, the flow becomes more barotropic)**?". 

Downstream of the topography, the flow is no longer stabilized by 
topographic slopes and the potential energy stored in the topographi- 
cally steepened isopycnals is released as a result of baroclinic instabil- 
ity”. Deviations from purely zonal flow by topographic steering help 
to destabilize the flow”?*”. Eddy growth rates are high immediately 
downstream of the topography, whereas eddy kinetic energy reaches a 
maximum further downstream, after the eddies have had time to grow? 2 
Meandering is often pronounced in the lee of topography and jets may 
merge and split”. Downstream of the ridge, advection and stirring by 
eddies are enhanced** and deep barotropic eddies facilitate cross-front 
exchange by increasing the angle between deep and shallow flows”. 

Localized dynamics and deviations from zonal symmetry are also 
important for the overturning circulation. Whereas wind-driven 
upwelling into the surface mixed layer occurs over broad areas, model 
studies suggest that the poleward and upward motion of deep water 
in the ocean interior is focused in narrow regions where eddies and 
bottom topography facilitate vertical motion*®*”. The circulation 
of deep water in the Southern Ocean interior therefore follows an 
upward spiral with mostly zonal flow along isopycnals and along depth 
horizons in regions of smooth topography, connected by rising and 
poleward flow where the ACC interacts with topographic obstacles 
and generates enhanced eddy fluxes. Upwelling of deep water occurs 
mainly along isopycnals; although weak diapycnal mixing over broad 
scales contributes to changes in density, most of the water-mass 
transformation occurs in the surface layer, where air-sea forcing and 
diapycnal mixing are both strong*®. Vertical motion is also associated 
with the bottom-pressure torque that is created when the deep flow 
crosses isobaths’. 

The subduction of mode and intermediate water by the upper cell of 
the Southern Ocean overturning circulation is also focused in local hot 
spots**. Subduction is large where horizontal flow crosses the sloping 
base of the surface mixed layer (a process known as lateral induction). 
Because the flow is steered by topography, and the spatial distribution 
of mixed layer depth is influenced by the large-scale circulation, the 
distribution of subduction is also influenced by bathymetry. The sub- 
duction of anthropogenic carbon into the interior is similarly focused 
in local hot spots”, including standing meanders of the ACC“". 

Eddies also mix and stir tracers, primarily along isopycnals. It has 
been traditional in ocean modelling to assume mixing coefficients 
that are constant in time and space. Advances in theory, motivated by 
measurements of turbulence and tracer dispersion, have revealed both 
spatial and temporal variability in the strength of isopycnal mixing**” 
(Fig. 3). Stirring along isopycnals is suppressed by one to two orders 
of magnitude in the core of the ACC jets, where the flow is sufficiently 
strong to carry tracers downstream before eddies have a chance to mix 
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Fig. 2 | Interaction of the ACC with topography. The schematic 
illustrates the dynamical processes at work in the distinct dynamical 


cross-stream**3, At depth, where the flow is weaker, the jets no longer 
inhibit stirring along isopycnals. 

Microstructure measurements and tracer dispersion experiments 
have also revealed how the strength of diapycnal mixing varies in 
the Southern Ocean**“*, with the Diapycnal and Isopycnal Mixing 
Experiment in the Southern Ocean (DIMES) being the most notable 
example. Dissipation is enhanced in the upper 1,000 m by downward 
propagation and breaking of near-inertial waves generated by strong 
wind forcing*”*8. Diapycnal mixing is also enhanced above rough 
topography, where internal lee waves generated by the interaction of the 
flow with topography propagate upwards and break***?-5!. The large 
diapycnal mixing rates at depth in the Southern Ocean help to drive 
the deep overturning cell and exchange between deep layers**°*. Deep 
diapycnal mixing in the Indian and Pacific oceans also has a critical role 
in the deep overturning cell: abyssal waters exported to the northern 
basins are converted by diapycnal mixing to slightly less-dense deep 
waters that return to the Southern Ocean and feed the upwelling limb 
of the overturning circulation***. In other words, the global overturn- 
ing cell associated with sinking of dense water in the North Atlantic is 
closed first by deep mixing in the deep Indian and Pacific oceans and 
then by upwelling and water-mass transformations in the surface layer 
in the Southern Ocean. In the ocean interior, away from the sea surface 
and from rough topography, diapycnal mixing in the Southern Ocean 
is weak, as found in the rest of the global ocean*®. 


Southern Ocean uptake of heat and carbon 

The strength of the Southern Ocean overturning circulation regulates 
the exchange of heat, carbon dioxide and other properties between the 
deep ocean and the surface layer and therefore has broad implications 
for climate and biogeochemical cycles. For example, the sinking of 
surface waters in the descending branches of the two overturning cells 
carries oxygen-rich water to ventilate the ocean interior. The upwelling 
limb of the overturning cells transfers nutrients from the deep ocean 
to the surface ocean, balancing the downward flux of nutrients and 
carbon via the sinking of organic matter; models suggest that upwelling 
and export of nutrients by the upper overturning cell support up to 
three-quarters of global marine primary production north of 30° S***°, 
Likewise, the Southern Ocean influences atmospheric carbon dioxide 
levels on glacial—interglacial timescales: stronger upwelling of deep 
water vents more natural carbon to the atmosphere, warming the 
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regimes upstream, over (‘topographic’) and downstream of a topographic 
obstacle (represented by the shaded contours) in the path of the current. 


climate during interglacial periods, whereas weaker upwelling results 
in more carbon being trapped in the deep ocean, cooling the climate 
during glacial periods™. 

Of particular relevance to future climate is the role of the Southern 
Ocean overturning in the carbon and heat budget of the ocean. The 
ocean south of 40° S dominates the global ocean uptake of anthro- 
pogenic heat and carbon dioxide®*°”>8, As the atmosphere warms 
and heats the ocean, the northward Ekman transport exports heat to 
the north, delaying warming south of the ACC and enhancing warm- 
ing north of the ACC, where subduction of intermediate and mode 
waters carries heat into the ocean interior*’. Over the past decade, the 
Southern Hemisphere has made the dominant contribution to the 
increase in global ocean heat content, reflecting both the transfer of 
heat to mode and intermediate waters by the overturning circulation 
and the deepening and spin-up of the subtropical gyres®°*!. The upper 
cell of the overturning takes up and exports anthropogenic carbon 
dioxide in a similar manner”*’. For example, in a coupled climate- 
carbon model, the ocean south of 30° S (which represents 30% of the 
surface area of the global ocean) accounts for 75% + 22% of anthro- 
pogenic heat uptake and 43% + 3% of anthropogenic carbon dioxide 
uptake by the global ocean over the historical period’. The net exchange 
of carbon between the atmosphere and the Southern Ocean depends 
on two competing effects, both of which are influenced strongly by 
the overturning circulation: the outgassing of natural carbon driven 
by upwelling of carbon-rich deep water, and the uptake, transport and 
storage of anthropogenic carbon. 

Given the prominent role of the Southern Ocean in the exchange 
of carbon between the atmosphere and the ocean, changes in the 
ability of the region to take up carbon dioxide would have substan- 
tial consequences for the global carbon cycle and climate. A decade 
ago, ocean models and atmospheric inversions suggested that the 
Southern Ocean carbon sink was ‘saturated’ and no longer keeping 
pace with increases in atmospheric carbon dioxide™®. The reduction 
in the strength of the Southern Ocean carbon sink was attributed to 
increased outgassing of natural carbon associated with a wind-driven 
strengthening of the overturning circulation. This raised concerns 
that further weakening of the Southern Ocean carbon sink could con- 
tribute a positive feedback to climate change. At that time, there were 
insufficient ocean carbon data to estimate changes in ocean carbon 
uptake directly. 
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Fig. 3 | Spatial distribution of mixing along and across isopycnals. 
The schematic illustrates the spatial variability of mixing processes in 
the Southern Ocean. Isopycnal mixing (double-headed blue arrows) 

is inhibited in the upper part of the ACC jets (the Polar Front and 
Subantarctic Front; hatched region and contours indicate flow speed), 
where the mean flow is strong relative to the eddies (hatched region 
and contours, indicating flow speed). The strength of isopycnal mixing 
depends on eddy kinetic energy and hence wind strength. Diapycnal 
mixing (squiggly red arrows) is enhanced near the surface, where 
wind-generated, downward-propagating, inertial waves break, and near 


Longer and more complete time series of the partial pressure of 
carbon dioxide and new approaches to data analysis and mapping 
have revealed unanticipated variability in the Southern Ocean carbon 
sink®*®, As found in the earlier studies, the ocean carbon sink south of 
35° S weakened in the 1990s, but strengthened again between 2002 and 
2011 by 0.6 petagrams of carbon per year, or half the magnitude of the 
global trend in the ocean carbon sink over this period™. The ‘reinvig- 
oration of the Southern Ocean carbon sink was attributed to changes 
in both the wind-driven overturning circulation®+® and the sea 
surface temperature (and hence the solubility of carbon dioxide), with 
colder temperatures dominating in the Pacific and weaker upwelling 
dominating in the Atlantic®. Whereas earlier studies emphasized the 
impact of changes in zonally averaged westerly winds on the Southern 
Ocean carbon sink, the more recent works show that the sink is also 
sensitive to regional wind anomalies. The large magnitude of temporal 
changes in the carbon sink underscores the importance of the Southern 
Ocean to global budgets and the sensitivity of this sink to temporal and 
regional variations in climate forcing. 


Sea-ice conveyor of fresh water 

The existence of an overturning circulation in the Southern Ocean 
requires buoyancy forcing to transform water from one density class 
to another. More specifically, the conversion of upwelled deep water to 
lighter mode and intermediate waters requires an input of buoyancy 
through heating or addition of fresh water, whereas conversion to dense 
bottom water requires cooling or addition of salt during sea-ice forma- 
tion. Most previous work has focused on buoyancy forcing by air-sea 
heat exchange and fresh water added by an excess of precipitation over 
evaporation®°°, However, improved estimates of sea-ice transport from 
observations and models indicate that ‘distillation’ of fresh water by sea- 
ice formation, transport and melt constitutes the dominant contribu- 
tion to the buoyancy that is needed to close the upper cell®”-”". Sea-ice 
formation rates are highest at high latitude, on the Antarctic continental 
shelf, where brine release contributes to the formation of dense shelf 
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topography, where lee waves generated by the interaction of the flow with 
topography propagate upwards and break. Diapycnal mixing is weak in 
the remainder of the domain. The dotted lines indicate contours of neutral 
density The right panel shows the distribution of cross-isopycnal mixing 
(dissipation of temperature variance) with depth. The major water masses 
are labelled (AABW, Antarctic Bottom Water; LCDW, Lower Circumpolar 
Deep Water; UCDW, Upper Circumpolar Deep Water; AAIW, Antarctic 
Intermediate Water; SAMW, Subantarctic Mode Water). The right panel 
shows that cross-isopycnal mixing (dissipation of temperature variance) is 
typically enhanced in the upper ocean and within 1-2 km of the sea floor. 


water. Once formed, sea ice is exported to the north by Ekman trans- 
port and ocean gyres and melts. This results in a sink of fresh water over 
the continental shelf and a source of fresh water at lower latitudes. In 
this way, sea ice contributes to both the lower cell, where brine released 
during sea-ice formation contributes to the production of AABW, and 
the upper cell, where fresh water from sea-ice melt helps to convert 
deep water to lighter mode and intermediate water. 

Because the strength of the overturning circulation is set by the 
buoyancy forcing*!’, changes in sea-ice formation and export may 
drive changes in the overturning circulation. Large regional trends in 
Antarctic sea-ice extent and persistence have been observed in recent 
decades®”!"’2, with the retreat of ice in the eastern Pacific and growth 
of ice in the Ross Sea sector attributed to zonal asymmetries in wind 
forcing” |; however, the response of the overturning circulation to the 
resulting regional anomalies in buoyancy forcing has not been investi- 
gated. Changes in fresh-water input from increases in high-latitude pre- 
cipitation as the atmosphere warms”? or from increased glacial melt”*”> 
would also alter buoyancy forcing and water-mass formation in the 
Southern Ocean, with implications for the overturning circulation. 
The overturning circulation may also affect the distribution of sea ice, 
with the sign of the response depending on the timescale considered”®. 
The instantaneous response to an increase in the westerly winds is the 
stronger northward Ekman transport of cold water and the expansion 
of sea ice. But continued strong upwelling eventually brings the deeper 
warm water to the surface, driving warming and sea-ice retreat. The 
short-timescale effect may help to explain the overall expansion of sea 
ice in recent decades, but cannot explain the zonal asymmetry between 
the sea-ice changes observed in the eastern and western Pacific. 


Ocean influence on the Antarctic Ice Sheet 

Floating ice shelves buttress the Antarctic Ice Sheet by providing a back 
stress that resists the flow of glacial ice to the sea””. Thinning or retreat 
of ice shelves may therefore reduce the buttressing effect and cause 
increased export of ice and a rise in sea level’*. Melt of the ice shelf 
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Fig. 4 | Processes that control ocean heat flux to the Antarctic margin. 
The schematic illustrates physical processes that can influence the 
transport of warm water from the open ocean to the base of the floating 
ice shelves. The Antarctic Slope Front (ASF; white hatching and shading 
on the sea surface) forms the boundary between cold waters over the 
continental shelf and warm waters offshore. Processes that modify the 
strength and position of the Antarctic Slope Front (such as tides, wind or 
Kelvin waves) can influence the transport of warm water onto the shelf. 
Wind or buoyancy forcing (local or remote) can change the depth of the 
thermocline on the shelf (dotted white line), affecting how much warm 
water can reach the ice-shelf cavity. Eddies contribute to transferring 


from below by warm waters entering the sub-ice-shelf cavity influences 
the thickness of ice shelves and their buttressing capacity’’. The parts 
of the ice sheet that are grounded below present-day sea level (that 
is, marine-based ice sheets) are particularly sensitive to ocean-driven 
change in the ice shelves at their seaward edge. The marine-based ice 
shelves in the Amundsen and Bellingshausen seas are more exposed 
to ocean heat flux than are other parts of the Antarctic margin because 
the warm waters of the ACC abut the continental shelf in that region. 
The most rapid thinning and mass loss has occurred in the Amundsen 
and Bellingshausen seas, where waters over the continental shelf have 
warmed®. 

However, there is growing evidence that the marine-based portion 
of the East Antarctic Ice Sheet may also be vulnerable to ocean-driven 
melt. Satellite measurements show that parts of the East Antarctic Ice 
Sheet, including the Totten glacier that holds a volume of ice equiv- 
alent to more than 3.5 m of global sea-level rise, have thinned and 
that grounding lines have retreated in recent decades*!. The Totten 
ice shelf experiences basal melt rates that are exceeded only by those 
of ice shelves in the Amundsen Sea*”*3—a surprising result given the 
expectation that this part of the Antarctic margin was more isolated 
from warm ocean waters. But recent oceanographic observations on the 
continental shelf near the Totten ice shelf found that warm water was 
widespread, persisted through the year and reached the sub-ice-shelf 
cavity through a deep channel***°. Recent simulations of the response 
of the ice sheet to future changes in climate suggest that the Aurora and 
Wilkes basins in East Antarctica will make a substantial contribution 
to future sea-level rise, with ice-sheet retreat initiated by ocean heat 
flux8©87, 

To assess the potential vulnerability of the Antarctic Ice Sheet, the 
processes that regulate ocean heat transport to the sub-ice-shelf cavi- 
ties, and their sensitivity to changes in forcing, need to be understood 
(Fig. 4). A key factor is the reservoir of ocean heat available for transfer 
across the shelf break, which is influenced by large-scale circulation 
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warm water across the edge of the continental shelf. Warm water can also 
be steered towards the shelf through deep troughs in the continental shelf. 
Heat loss in coastal polynyas can vent heat from warm waters on the shelf 
before they reach the ice shelves, thereby inhibiting basal melt, or drive 
export of cold, dense shelf waters (blue arrows) that is balanced by onshore 
transport of warm offshore waters (red arrows). The large-scale circulation 
(for example, gyres and the ACC) and changes in deep water properties or 
upwelling strength can influence the reservoir of ocean heat available for 
transport onto the shelf. Changes in wind can affect the upwelling of warm 
deep water and ocean currents so as to either facilitate or inhibit cross- 
shelf exchange of warm water. 


features such as the ACC, the Weddell and Ross gyres and wind-driven 
upwelling. The boundary between warm offshore waters and cooler 
waters over the continental shelf (the Antarctic Slope Front) can move 
or change in strength in response to local and remote forcing®®, altering 
the temperature of water available for transport across the shelf break. 
Warm water present at the shelf break can be transported onto the shelf 
by eddies®?, Kelvin waves* or by currents flowing along bathymetric 
contours in deep troughs on the continental shelf”. Air-sea interaction 
over the continental shelf influences how much of the heat that reaches 
the continental shelf makes it as far as the ice-shelf cavity. Strong ocean 
heat loss in polynyas can vent heat from the ocean before it reaches the 
ice shelf*'. Wind forcing (local or remote) and polynya activity can 
alter the depth of the thermocline and so restrict or enhance the ocean 
heat flux to the cavity”. While ocean heat flux drives the melting of ice 
shelves, the input of glacial meltwater influences ocean circulation and 
sea ice”*+*?, Fresh water supplied by glacial melt increases the stratifica- 
tion of shelf waters, inhibiting deep convection and thereby reducing 
the formation of AABW and further enhancing basal melt by allowing 
ocean heat at depth to reach the ice shelves rather than be lost to the 
atmosphere’”>*. Although progress has been made in identifying the 
processes that regulate ocean heat transport to ice-shelf cavities, it is 
not yet possible to determine the relative importance of these processes, 
now and in the future. 


Past and future change in the Southern Ocean 

Given the effect of Southern Ocean processes on the global ocean cir- 
culation, climate and sea level, changes in the region could have wide- 
spread consequences. For example, changes in the amount of heat and 
carbon dioxide sequestered by the overturning circulation would act as 
a feedback on the rate of climate change. Increased ocean heat transport 
to ice-shelf cavities would drive increased basal melt, reduced buttress- 
ing, loss of mass from the Antarctic Ice Sheet and a rise in sea level*®. 
Despite recent progress, understanding of Southern Ocean dynamics 
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still falls short of a complete theory that enables quantitative predictions 
of future changes in circulation. Nevertheless, recent observations of 
variability and change and advances in physical understanding provide 
some clues to guide a qualitative assessment of how the region will 
respond to changes in forcing. 

Changes in various Southern Ocean properties have been docu- 
mented in recent decades. The upper 2,000 m of the Southern Ocean 
has warmed and freshened'®”**° (such as by more than 0.1°C per 
decade and more than 0.015 practical salinity units per decade at 
depths of 300-500 m over the past four decades'*), with the largest 
changes in ocean heat content on the northern side of the ACC. Both 
air-sea exchange and shifts in the position of the ACC have probably 
contributed to the changes in water properties”®. Waters near the sea 
floor on the continental shelf of the Amundsen and Bellingshausen 
seas have warmed®”, Eddy kinetic energy in the ACC has increased 
between the 1990s and the present”’. The inventory of anthropogenic 
carbon has increased, with the largest changes in intermediate and 
mode waters north of the ACC®**. Changes in chlorofluorocarbons 
between the 1990s and early 2000s have been interpreted as evidence 
for stronger upwelling of poorly ventilated deep water and stronger sub- 
duction of well-ventilated intermediate waters’®, whereas more recent 
work suggests a reduction in overturning in the most recent decade. 
Widespread freshening, warming and contraction of AABW has been 
observed over the past 30-50 years?-', 

What do these recent changes tell us about the sensitivity of the 
Southern Ocean to changes in forcing? Many of the changes summa- 
rized above are consistent with a spin-up of the wind-driven overturn- 
ing cell. The westerly winds shifted south and strengthened between the 
1980s and early 2000s, associated with a positive trend in the Southern 
Annular Mode, the dominant mode of variability of the Southern 
Hemisphere atmosphere’. Changes in wind forcing over the Southern 
Ocean have been linked to loss of ozone'™, greenhouse gas forcing! 
and teleconnections to the tropics!°°. The changes in the ocean inven- 
tory of heat, fresh water and dissolved gases observed in recent decades 
are generally consistent with a strengthening of the wind-driven over- 
turning, supporting the hypothesis that the eddy-driven circulation only 
partially compensates wind-driven changes in overturning’””. Evidence 
from observations!” and model studies””-** suggest that the ACC is 
close to the eddy saturation limit; hence, increases in wind forcing 
drive an increase in eddy kinetic energy, but little change in transport. 
Observations that demonstrate that the baroclinic transport of the ACC 
has remained roughly constant as wind forcing has strengthened”, 
while eddy kinetic energy has increased”’, support this hypothesis. 

How will the Southern Ocean change in the future? The west- 
erly winds are expected to continue to strengthen and shift south 
in response to greenhouse gas forcing’. As the climate warms, we 
can anticipate an increase in heat input to the ocean, an increase in 
fresh-water input from enhanced precipitation”? and ice melt® (includ- 
ing sea ice, icebergs and glacial melt), and hence an increase in the 
input of buoyancy to the ocean. The response of the Southern Ocean 
to these projected changes in forcing can be assessed by considering 
the contribution of different terms in the tracer balances. 

Models and observations suggest that the first-order response to 
these changes in forcing will be passive advection of climate anom- 
alies by the mean flow (that is, VAT, where V is the unperturbed, 
three-dimensional, mean flow and AT is the anomaly in tracer con- 
centration)°”!®, As the climate changes, the surface ocean is warmed, 
freshened and enriched in anthropogenic carbon dioxide by exchange 
with the atmosphere. These anomalies are swept north by the upper 
cell of the overturning circulation, increasing the inventory of anthro- 
pogenic heat, fresh water and carbon dioxide north of the ACC. The 
climate anomalies enter the interior ocean through localized subduc- 
tion hot spots*™; that is, anomalies in heat and other properties are 
harvested over broad spatial scales that are set by the wind and pumped 
into the interior through narrow windows whose distribution reflects 
interaction of the mean flow with the topography of the sea floor and 
of the mixed layer. 
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On the basis of the preceding discussion of Southern Ocean dynam- 
ics, we can anticipate that climate change will also alter the circulation, 
contributing to changes in water properties and inventories (AVThye, 
where AV is the climate-driven anomaly in circulation and Taye is the 
mean tracer concentration). Stronger winds will drive a stronger wind- 
driven cell, which will be partially compensated by a more energetic 
eddy-driven cell*!. Warming and increased fresh-water input will 
increase the buoyancy input to the ocean, driving the stronger water- 
mass transformations required by a spin-up of the upper cell of the 
overturning circulation. The buoyancy transport by the overturning 
circulation must increase to balance the enhanced buoyancy input, 
through an increase in the strength of the overturning circulation*!” 
or in the density difference between the upper and lower branches of 
the overturning (as illustrated in an idealized model’). An increase 
in strength of the upper cell will act on mean isopycnal and diapycnal 
gradients to transport heat and other climate properties. Because eddy 
diffusivity is a function of eddy kinetic energy, an increase in eddy 
energy as a consequence of increased wind forcing will strengthen eddy 
transport along isopycnals’”’. A stronger upper cell will drive changes 
in the subsurface ocean by increasing the poleward and upward trans- 
port of old, poorly ventilated deep water south of the ACC and the sub- 
duction of young, well-ventilated mode and intermediate waters north 
of the ACC!°. This spin-up of the overturning circulation acts in the 
same sense as passive advection of climate anomalies by the mean flow, 
flushing climate anomalies from south to north across the ACC and 
increasing the inventory in subducted mode and intermediate waters. 
Therefore, we expect warming, freshening and increased storage of 
anthropogenic carbon dioxide north of the ACC and smaller changes 
in inventories south of the ACC. 

The future of the Southern Ocean carbon sink is difficult to assess: 
stronger upwelling will mean more outgassing of natural carbon dioxide, 
whereas a stronger upper cell will take up and store more anthro- 
pogenic carbon dioxide?”***. Whether the net Southern Ocean carbon 
sink increases or decreases depends on the extent to which eddies com- 
pensate changes in the wind-driven overturning, and the response of 
the eddy-driven cell remains uncertain. Changes in surface temperature 
in response to regional and larger-scale wind anomalies will also influ- 
ence the strength of the carbon sink™. Stronger upwelling of relatively 
warm deep water may also increase the ocean heat transport to the base 
of floating ice shelves; but, as outlined above, the delivery of heat to the 
ice-shelf cavities depends on many factors which themselves are likely 
to be affected by changes in climate forcing. 

Changes in surface forcing will drive further changes in the Southern 
Ocean that will have consequences for climate. Warming and fresh- 
ening as a result of air-sea exchange and ice melt will enhance strat- 
ification in the upper ocean, inhibiting exchange between the mixed 
layer and the interior®, weakening deep convection” and reducing the 
density of shelf waters that contribute to AABW formation”. We might 
therefore anticipate a weakening of the lower cell, although changes in 
recent decades suggest lighter bottom waters will continue to venti- 
late the abyssal ocean! until a threshold is reached when winter shelf 
waters are too light to sink to the abyssal ocean. Increased winds may 
drive increased diapycnal mixing at depth over rough topography“®, 
increasing a ‘short circuit’ in the abyssal ocean that would reduce the 
amount of well-ventilated bottom water that reaches the basins to the 
north, further reducing the influence of the lower cell. 

A consistent theme of this Review has been the importance of local 
and regional dynamics, which are often linked to topography. Although 
the topography does not change with time, and we can therefore antic- 
ipate that the same topographic features will continue to localize the 
dynamics of the Southern Ocean circulation, changes in the path of 
the current will affect dynamical balances. For example, small changes 
in the angle at which the flow intersects topography can change the 
torque that is exerted by the flow on the sea floor”’. Likewise, changes 
in the path of the ACC will alter pressure differences across topogra- 
phy, changing the stress that is exerted by the flow on the sea floor”®. 
Standing meanders in the lee of topography are probably of particular 
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importance, where stronger wind forcing drives an increase in mean- 
dering, increased instability, more eddy activity, enhanced downward 
transfer of momentum and acceleration of the deep flow; interaction 
of the deep flow with topography establishes bottom-form stress and 
bottom-pressure torque to balance the forcing*”. Analysis of the time- 
dependent momentum balance of the ACC suggests that the adjustment 
to a change in wind involves rapid barotropic processes that enable 
a nearly instantaneous response of bottom-form stress to changes in 
wind forcing”®, 

The above discussion of the future of the Southern Ocean is informed 
by recent progress in dynamical understanding, but remains specu- 
lative, reflecting both gaps in the theoretical underpinning of the 
Southern Ocean circulation and uncertainties in future climate forcing. 
The most substantial gap in physical understanding is the response of 
the eddy field to changes in forcing. This Review has highlighted how 
eddies—transient and stationary—are intimately involved in almost 
every aspect of Southern Ocean dynamics, helping to set the strength 
and vertical and horizontal structure of the mean flow (the ACC and 
the overturning), to drive meridional flow, to navigate complex topo- 
graphy, to shape the complex temporal and spatial distribution of ocean 
mixing, and to transport energy, momentum, vorticity and tracers. 


Outlook and open questions 

Substantial progress has been made in recent years in understanding 
the dynamics and global influence of the Southern Ocean, under- 
pinned by a revolution in ocean observing and advances in theory and 
numerical simulation. These discoveries have shown that the Southern 
Ocean needs to be viewed through both wide-angle and close-up lenses. 
Southern Ocean processes have a disproportionate effect on global 
climate, biogeochemical cycles and sea level, and are linked to low 
latitudes through diverse teleconnections that involve interactions 
between the atmosphere, ocean and cryosphere. On the other hand, 
these global effects are the expression of dynamics that play out largely 
on local and regional scales, often mediated by topography. 

Many questions remain open. Perhaps of greatest importance to cli- 
mate and sea level is the uncertain response of the Southern Ocean 
overturning circulation—in particular, the eddy-driven contribution— 
to changes in wind and buoyancy forcing. Eddies are now known to 
transfer momentum and vorticity from the sea surface to the sea floor, 
but the detailed pathways and their sensitivity to changes in forcing 
are unknown. Recent studies have highlighted how the upwelling and 
downwelling limbs of the overturning circulation are localized by topog- 
raphy, as is cross-front exchange, but the three-dimensional structure of 
the overturning remains obscure. Knowledge of the nature and causes 
of variability in the Southern Ocean is rudimentary, including the rel- 
ative contributions of local forcing, teleconnections to low latitude and 
intrinsic variability. The theoretical foundation for Southern Ocean 
dynamics is developing rapidly, but remains incomplete. This gap is 
reflected, for example, by the speculative nature of the above discussion 
of the response of the Southern Ocean to changes in forcing and by our 
inability to do much more than list the mechanisms that influence the 
delivery of ocean heat to the Antarctic margin. The fact that new obser- 
vations continue to reveal surprises that challenge existing thinking also 
underscores gaps in knowledge; examples include the unanticipated var- 
iability of the Southern Ocean carbon sink, the dominant contribution 
of the Southern Hemisphere to the change in global ocean heat content 
in the past decade, and evidence that the East Antarctic Ice Sheet is more 
exposed to ocean heat transport than once thought. 

The prospects for further progress are encouraging. The Argo array 
of profiling floats has allowed year-round, broad-scale observations 
of the data-sparse Southern Ocean for the past decade, and many of 
the recent insights summarized in this Review have relied on these 
new measurements. As the profiling float array expands into the ice- 
covered and deep ocean, further breakthroughs will follow. The capability 
of other observing tools such as gliders and autonomous underwater 
vehicles is also increasing, making measurements feasible in previously 
inaccessible areas such as beneath floating ice shelves and perennial 
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ice. Continued growth in computing power is allowing numerical 
exploration of Southern Ocean dynamics with high-resolution models 
that resolve critical physical processes that act at small scales. Insights 
gained in this way promise to improve the parameterizations used in 
Earth system models, helping to reduce biases in their representation of 
the Southern Ocean. The combination of new observations, advances in 
theory and improvements in modelling promises to deliver a much bet- 
ter understanding of how the Southern Ocean circulation will respond 
to future change and how it will influence global climate, biogeochem- 
ical cycles and sea-level rise. 
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Mass balance of the Antarctic Ice Sheet 


from 1992 to 2017 


The IMBIE team* 


The Antarctic Ice Sheet is an important indicator of climate change and driver of sea-level rise. Here we combine satellite 
observations of its changing volume, flow and gravitational attraction with modelling of its surface mass balance to show 
that it lost 2,720 £1,390 billion tonnes of ice between 1992 and 2017, which corresponds to an increase in mean sea level 
of 7.6 +3.9 millimetres (errors are one standard deviation). Over this period, ocean-driven melting has caused rates of 
ice loss from West Antarctica to increase from 53 + 29 billion to 159 + 26 billion tonnes per year; ice-shelf collapse has 
increased the rate of ice loss from the Antarctic Peninsula from 7 + 13 billion to 33 + 16 billion tonnes per year. We find 
large variations in and among model estimates of surface mass balance and glacial isostatic adjustment for East Antarctica, 
with its average rate of mass gain over the period 1992-2017 (5 + 46 billion tonnes per year) being the least certain. 


level by 58 m!. They channel ice to the oceans through a network 

of glaciers and ice streams’, each with a substantial inland catch- 
ment’. Fluctuations in the mass of grounded ice sheets arise owing to 
differences between net snow accumulation at the surface, meltwater 
runoff and ice discharge into the ocean. In recent decades, reductions in 
the thickness* and extent? of floating ice shelves have disturbed inland 
ice flow, triggering retreat®’, acceleration®? and drawdown!”"! of many 
marine-terminating ice streams. Various techniques have been developed 
to measure changes in ice-sheet mass, based on satellite observations 
of their speed'”, volume!’ and gravitational attraction'+ combined with 
modelled surface mass balance (SMB)’* and glacial isostatic adjustment 
(GIA; the ongoing movement of land associated with changes in ice load- 
ing)!°. Since 1989, there have been more than 150 assessments of ice 
loss from Antarctica based on these approaches?”. An inter-comparison 
of 12 such estimates!® demonstrated that the three principal satellite 
techniques provide similar results at the continental scale and, when 
combined, lead to an estimated mass loss of 71 + 53 billion tonnes of 
ice per year (Gt yr~') averaged over the period 1992-2011 (errors are 
one standard deviation unless stated otherwise). Here, we extend this 
assessment to include twice as many studies, doubling the overlap period 
and extending the record to 2017. 


r lhe ice sheets of Antarctica hold enough water to raise global sea 


Satellite observations 

We collated 24 independently derived estimates of ice-sheet mass bal- 
ance (Fig. 1) that were determined within the period 1992-2017 and 
based on the techniques of satellite altimetry (seven estimates), 
gravimetry (15 estimates) or the input-output method (two esti- 
mates). Altogether, 24, 24 and 23 individual estimates of mass change 
were computed within defined geographical limits*’? for the East 
Antarctic Ice Sheet (EAIS), West Antarctic Ice Sheet (WAIS) and 
Antarctic Peninsula Ice Sheet (APIS), respectively. We compared the 
rates of ice-sheet mass change (see Methods) over common intervals 
of time!®. We then averaged the rates of ice-sheet mass balance using 
the same class of satellite observations to produce three technique- 
dependent time series of mass change in each geographical region 
(see Methods). Within each class, we computed the uncertainty in 
the annual mass rate as the mean uncertainty of the individual 


contributions. The final, reconciled estimate of ice-sheet mass change 
for each region was computed as the mean of the technique- 
dependent values available at each epoch (Fig. 1). In computing the 
associated uncertainty, we assume that the errors for each technique 
are independent. To estimate the cumulative mass change and its 
uncertainty (Fig. 2), we integrated the reconciled estimates for each 
ice sheet and weighted the annual uncertainty by 1 /-/7, where n is the 
number of years since the start of each time series. We computed 
Antarctic Ice Sheet (AIS) mass trends as the linear sum of the regional 
trends and the uncertainties in the mass trends as the root-sum-square 
of the regional uncertainties (Table 1). 


Trends in Antarctic ice-sheet mass 

The level of disagreement between individual estimates of ice-sheet mass 
balance increases with the area of each ice-sheet region, with average 
per-epoch standard deviations of 11 Gt yr~!, 21 Gt yr! and 37 Gt yr! 
at the APIS, the WAIS and the EAIS, respectively (Fig. 1, Methods). 
Among the techniques, gravimetric estimates are the most abundant 
and also the most closely aligned, although their spread increases in East 
Antarctica, where GIA remains poorly constrained” and is least certain 
when spatially integrated”!-*”, owing to the vast extent of the region. 
Solutions based on satellite altimetry and the input-output method run 
for the entire record, roughly twice the duration of the gravimetry time 
series. Although most (59%) estimates are within one standard deviation 
of the technique-dependent mean, a few (6%) depart by more than three 
standard deviations. At the Antarctic Peninsula, the 25-year average 
rate of ice-sheet mass balance is —20 + 15 Gt yr~', with an increase of 
about 15 Gt yr in losses since 2000. The strongest signal and trend has 
occurred in West Antarctica, where rates of mass loss increased from 
53 + 29 Gtyr7! to 159 + 26 Gt yr“ between the first and final five years 
of our survey; the largest increase occurred during the late 2000s when 
ice discharge from the Amundsen Sea sector accelerated**. Both of these 
regional losses are driven by reductions in the thickness and extent of 
floating ice shelves, which has triggered the retreat, acceleration and 
drawdown of marine-terminating glaciers**. The least certain result is in 
East Antarctica, where the average 25-year mass trend is 5 + 46 Gt yr7’. 
Overall, the AIS lost 2,720 + 1,390 Gt of ice between 1992 and 2017, an 
average rate of 109 + 56 Gt yr7?. 


*A list of authors and their affiliations appears at the end of the paper. 
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Fig. 1 | Antarctic Ice Sheet mass balance. a—c, Rate of mass change 
(dM/dt) of the APIS (a), WAIS (b) and EAIS (c), as determined from the 
various satellite-altimetry (purple), input-output-method (blue) and 
gravimetry (green) assessments included in this study. In each case, 
dM/dt is computed from time series of relative mass change using a 
three-year window at annual intervals. An average of estimates across 
each class of measurement technique is also shown for each year (black). 
The estimated 1a, 20 and 3c ranges of the class averages are shaded in 
dark, mid and light grey, respectively; the number of individual 
mass-balance estimates collated at each epoch is shown below. 


Surface mass balance 

Knowledge of the ice-sheet SMB is an essential component of the 
input-output method, which subtracts solid-ice discharge from net 
snow accumulation, and aids interpretation of mass trends derived 
from satellite altimetry and gravimetry. Snowfall is the main driver 
of temporal and spatial variability in AIS surface mass change*>*®. 
Although locally important, spatially integrated sublimation and melt- 
water runoff are typically one and two orders of magnitude smaller, 
respectively. In the absence of observation-based maps, AIS SMB is 
usually taken from atmospheric models, evaluated with in situ and 
remotely sensed observations!>?”~*°. To assess Antarctic SMB, we com- 
pared two global reanalysis products (JRA55 and ERA-Interim) and 
two regional climate models (RACMO2 and MARv3.6) (see Methods). 
ERA-Interim is usually regarded as the best-performing reanalysis 
product over Antarctica, albeit with a dry bias in the interior and 
overestimated rain fraction*?*!”. Spatially averaged accumulation 
rates peak at the Antarctic Peninsula, and are roughly three and seven 
times lower in West and East Antarctica, respectively (Extended Data 
Figs. 2, 3). Compared to the all-model average SMB of 1,994 Gt yr7}, 
the regional climate models give values 4.7% higher and the reanalyses 
7% lower. These differences can be attributed to the higher resolution 
of the regional models, which resolve the steep coastal precipitation 
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Fig. 2 | Cumulative Antarctic Ice Sheet mass change. The cumulative 
ice-sheet mass changes (solid lines) are determined from the integral of 
monthly measurement-class averages (for example, the black lines in Fig. 1) 
for each ice sheet. The estimated 1o uncertainty of the cumulative change 
is shaded. The dashed lines show the results of a previous assessment. 


gradients in greater detail, and to their improved representation of polar 
processes. The temporal variability of all products is similar and they 
all agree on the absence of an ice-sheet-wide trend in SMB over the 
period 1979-2017, which implies that recent mass loss from the AIS is 
dominated by increased solid-ice discharge into the ocean. 


Glacial isostatic adjustment 

Gravimetric estimates of mass change are strongly influenced by the 
method used to correct for GIA**®. In this study, six different GIA 
models were used?!463031,43, We also assessed nine continent-wide 
forward-model simulations and two regional model simulations to 
better understand uncertainties in the GIA signal; we reprocessed the 
gravimetry estimates of mass balance using the W12a” and IJ05_R2*! 
GIA models for comparison with earlier work!® (see Methods). The 
net gravitational effect of GIA across Antarctica is positive, and the 
mean and standard deviation of the continent-wide GIA models 
(54+ 18 Gt yr) is very close to that of the W12a (56 + 27 Gt yr’) 
and IJ05_R2 (55 + 13 Gt yr‘) models. The narrow spread probably 
reflects the difficulty of quantifying the timing and extent of past ice- 
sheet change and the absence of lateral variations in Earth rheology 
within some models“*. Models predict the greatest rates of solid-Earth 
uplift (5-7 mm yr7! on average) in areas where GIA is a substantial 
component of the regional mass change, such as the Amundsen, Ross 
and Filchner—Ronne sectors of West Antarctica (see Extended Data 
Fig. 4), but also the greatest variability (for example, a standard devi- 
ation of more than 10 mm yr~' in the Amundsen sector). Away from 
areas with large GIA signals there is low variance among the models 
and broad agreement with GPS observations*®. Nevertheless, most 
models considered here do not account for ice-sheet change during the 
past few millennia, because it is poorly known. Inaccurate treatment of 
low-degree harmonics associated with the global GIA signal can also 
bias gravimetric mass-balance calculations”. If the GIA signal includes 
a transient component associated with recent ice-sheet change, it will 
bias mass-trend estimates and should be accounted for in future work. 


Outlook 

Improvements in assessments of ice-sheet mass balance are still possi- 
ble. Airborne snow radar*”*’ is a powerful tool for evaluating models of 
SMB and firn compaction over large spatial (thousands of kilometres) 
and temporal (centennial) scales, in addition to the ice cores that have 
traditionally been used”. Geological constraints on the ice-sheet his- 
tory”° and GPS measurements of contemporary uplift enable GIA 
models to be scrutinized and calibrated. More of both of these types of 
datasets are needed, especially in East Antarctica. Given their apparent 
diversity, the spread of models of GIA and SMB should be evaluated in 
concert with the satellite gravimetry, altimetry and velocity measure- 
ments. A reassessment of the satellite measurements acquired during 
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Table 1 | Rates of ice-sheet mass change 


ANALYSIS 


1992-1997 (Gt yr) 


1997-2002 (Gt yr-1) 


2002-2007 (Gt yr-}) 


2007-2012 (Gt yr-}) 


2012-2017 (Gt yr-!) 


1992-2011 (Gt yr!) 


1992-2017 (Gt yr) 


EAIS 11458 8+ 56 12+ 43 23 +38 —28 + 30 13 + 50 5+ 46 
WAIS —53 +29 —41+ 28 —-65+27 —148+ 27 —159 + 26 —73 428 -94+427 
APIS -7+13 —6413 —20+15 —35417 —-33+416 —-16+14 —20+415 
AIS —-49 +67 —38 + 64 -73 +53 —160 + 50 —219+ 43 -76+59 —109 + 56 
Rates were determined from all satellite measurements over various epochs for the EAIS, WAIS and APIS, which combined constitute the AIS. The period 1992-2011 is included for comparison to a 


previous assessment!8, which reported mass-balance estimates of 14 + 43 Gt yr~! for the EAIS, —65 + 26 Gt yr~! for the WAIS, —20 + 14 Gt yr~! for the APIS and —71 + 53 Gt yr“! for the AIS. The 
small differences in our updated estimates for this period are due to our inclusion of more data. Errors are lo. 


the 1990s would address the imbalance that is present in the current 
record. Alternative techniques (see, for example, ref. °') for combin- 
ing satellite datasets should be explored, and satellite measurements 
with common temporal sampling should be contrasted. The ice-sheet 
mass-balance record should now be separated into the contributions 
due to short-term fluctuations in SMB and to longer-term trends in 
glacier ice. In addition to these improvements, continued satellite 
observations are, of course, essential. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0179-y 
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METHODS 


Data. No statistical methods were used to predetermine sample size. 

We analyse five groups of data: mass-balance estimates determined from sat- 
ellite altimetry, gravimetry and the input-output method, and model estimates 
of SMB and GIA. We compute the datasets using common spatial and temporal 
domains to facilitate their aggregation, according to previously reported methods 
(see Supplementary Table 1). In total, 24 mass-balance datasets were included. 
The data include 25 years of satellite-radar-altimeter measurements, 24 years of 
satellite input-output-method measurements and 14 years of satellite-gravimetry 
measurements (Extended Data Fig. 1). Among these data are estimates of ice-sheet 
mass balance for each ice sheet derived from each satellite technique. In compari- 
son to the first IMBIE assessment!®, new satellite missions, updated methodologies 
and improvements in geophysical corrections have contributed to an increase in 
the quantity, duration and overlap period of data used here. In addition, two new 
experiment groups have assessed 11 GIA and 4 SMB models. The complete list of 
datasets is provided in Supplementary Table 1. 

Drainage basins. In this assessment, we analyse mass trends using two sets of 
ice-sheet drainage basins (Extended Data Fig. 2) to ensure consistency with those 
used in the first IMBIE assessment’® and to evaluate an updated definition tailored 
towards assessments using the input-output method. The first drainage-basin set 
was delineated using surface elevation maps derived from ICESat-1 on the basis 
of the provenance of the ice and includes 27 basins®. The second set was updated 
to consider other factors such as the direction of ice flow and includes 18 basins 
in Antarctica”. To assess the effect of the different sets on the estimates of ice- 
sheet mass balance, we compared mass-balance determinations between the two 
delineations of ice-sheet drainage basins. This evaluation was facilitated by seven 
estimates (altimetry or gravimetry) determined using both sets. At the scale of 
the major ice-sheet divisions, the delineations produce similar total extents. By 
far the largest differences occur in the delineation (or definition) of East and West 
Antarctica, owing to differences in the position of the ice divide that separate 
them. Within these regions, the root-mean-square difference between 26 pairs of 
estimates of ice-sheet mass balance computed using the two drainage-basin sets 
is 8.7 Gt yr” !. This difference is small in comparison to the certainty of individual 
assessments of ice-sheet mass balance. 

Computing rates of mass change. The raw satellite mass-balance data are time 
series of either relative mass change AM(t) or the rate of mass change dM(t)/dt, 
plus their associated uncertainty, integrated over at least one of the ice-sheet regions 
defined in the standard drainage-basin sets. In the case of AM(f), the time series 
represents the change in mass through time relative to some nominal reference 
value. The duration and sampling frequency of the time series was not restricted. 
In practice, few mass time series were of AM(t) and dM(t)/dt. Because the 
inter-comparison exercise is based on comparing and aggregating dM(t)/dt, a 
common solution was implemented to derive dM(t)/dt values from datasets of 
AM(t) only. Each AM(t) time series was used to generate a time-varying estimate 
of dM(t)/dt, d[AM(t)]/dt = dM(t)/dt, and an estimate of the associated uncertainty, 
using a consistent approach. Time-varying estimates of dM(t)/dt were computed 
by applying a sliding fixed-period window to the AM(f) time series. At each node, 
defined by the sampling period of the input time series, dM(t)/dt and its standard 
error Cam(pyat Were estimated by fitting a linear trend to data within the window 
using a weighted least-squares approach, with each point weighted by its respective 
error variance ox mi): The regression error cama incorporates measurement 
errors and model structural error due to any variability that deviates from linear 
trends in ice mass, and may be a conservative estimate in locations where such 
deviation is present. Time series of dM(t)/dt computed using this approach were 
truncated by half the moving-average window period. When integrated, the 
dM(t)/dt time series correspond to a low-pass-filtered version of the original AM(t) 
time series. This linear regression assumes that uncertainties are uncorrelated; 
however, the smoothing that we apply during the calculation of the trend causes 
data points to be correlated during several epochs beyond the sliding window. 
Surface mass balance. Ice-sheet SMB comprises various processes governed by the 
interaction of the superficial snow and firn layers with the atmosphere. A direct 
mass exchange occurs via precipitation and surface sublimation. Snow drift and the 
formation of meltwater and its subsequent refreezing or retention redistribute mass 
spatially or lead to further mass loss via erosion and sublimation or via runoff. Here 
we compare a range of SMB products. Four SMB model solutions were considered 
for Antarctica (Extended Data Table 1): two regional models (RACMO2.3* and 
MARv3.6°”) and two global reanalysis products (JRA55°° and ERA-Interim™). The 
two regional climate models agree well in terms of their spatially integrated SMB, 
apart from the Peninsula where there is an offset of about 10 Gt per month between 
them (Extended Data Fig. 3). However, the reanalysis products underestimate 
the average SMB compared to the regional climate models by 200-350 Gt yr~1. 
Our SMB assessment illustrates that products of similar class (climate models or 
reanalysis products) agree well, suggesting that groupings of their output may 
be appropriate. However, we found that model resolution is important when 
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estimating SMB and its components, because contributions that differed by only 
the spatial resolution yielded differences at the regional scale. 

Glacial isostatic adjustment. GIA is the delayed response of the solid Earth to 
changes in time-variable surface loading through the growth and decay of ice 
sheets, and associated changes in sea level. Because GIA contributes to changes 
in the ice-sheet surface elevation and gravity field, it must be accounted for in 
measurements of the change in elevation and gravity for the purpose of isolating 
the contribution solely caused by ice-sheet imbalance. Here we compare differ- 
ent solutions derived from continuum-mechanical forward modelling to inform 
the interpretation of the satellite altimetry and gravimetry data that depend on 
the correction and to advise future assessments. Twelve GIA contributions were 
received that cover Antarctica (Extended Data Table 2), ten of which are global 
models”?! and two of which are regional models*. Because a broad array of 
data may be used to constrain GIA forward models, we anticipate spread in the 
predictions. 

Here we assess the degree of similarity between the various GIA model solu- 
tions. We identified areas of enhanced present-day vertical surface motion and 
(dis-)agreement between contributions by averaging the uplift rates over the contri- 
butions and computing the respective standard deviations (Extended Data Fig. 4). 
In some cases, it was necessary to estimate the GIA contribution to gravimetric 
mass trends; this was done using common geographical masks and truncation and 
a standardized treatment of low-degree harmonics. In Antarctica, the Amundsen 
Sea sector and the regions covered by the Ross and Filchner Ronne ice shelves 
stand out as having both high uplift rates (5-7 mm yr! on average) and high 
variability in uplift rates (peaking at more than 10 mm yr! standard deviation in 
the Amundsen sector) among the models considered. Elsewhere in coastal regions, 
uplift occurs at more moderate rates (about 2 mm yr! on average); the interior 
of East Antarctica exhibits slow subsidence. In these regions, the average signal is 
accompanied by relatively low variance among the GIA models (0-1.5 mm yr™! 
standard deviation). None of the models fully captures portions of the uplift that 
are observed to be very large (see, for example, ref. 55); hence, we anticipate a bias 
towards low values for the GIA correction averaged over such regions. In areas of 
low mantle viscosity, however, such as part of the WAIS, the GIA signal related to 
the Last Glacial Maximum may be overpredicted, and it is not clear whether a bias 
exists at the continental scale. 

Differences between the model predictions arise for various additional reasons. 
Technical differences in the modelling approach, for example, relating to the con- 
sideration of self-gravitation, ocean loading, rotational feedback and compressibil- 
ity, are most important at the global scale, but may explain only small differences 
among the regional models. Differing treatment of ice and ocean loading in regions 
that have experienced marine-based grounding-line retreat during the last gla- 
cial cycle may explain the differences in model predictions for the ICE_6G_C/ 
VM65a combination (see Supplementary Table 1). Some small differences should be 
expected when comparing models that use spherical-harmonic and finite-element 
approaches. Looking beyond consideration of the model physics, larger differ- 
ences arise owing to the various approaches used to determine the two principal 
unknowns associated with forward modelling of GIA—ice history and Earth 
rheology. There is no generally accepted ‘best approach to determining these 
inputs, and useful advances can be made by comparing the results of complemen- 
tary approaches. In the models considered here, approaches to determining the ice 
history include dynamical ice-sheet modelling, coupled ice-sheet-GIA modelling, 
tuning to fit geodetic constraints, tuning to fit geological constraints and use of 
direct observations of historical ice-sheet change. When defining the rheological 
properties of the solid Earth, most studies have opted to use a Maxwell rheology 
to define a radially symmetric Earth; however, the use of a power-law rheology or 
a fully three-dimensional Earth model to capture the spatial complexity of mantle 
properties is increasingly popular. An intermediate approach used in many of 
the datasets included here has been to develop a regional GIA model that reflects 
local Earth structure. Such models can be tuned, albeit imperfectly, to provide as 
accurate a representation of GIA in that region as is possible. However, it remains a 
difficult and important challenge to incorporate these regional studies into a global 
framework. Finally, although four of the GIA models that we consider provide 
a measure of uncertainty, and several studies have used an ensemble modelling 
approach?>”’, an important future goal for the GIA modelling community is the 
inclusion of robust error estimates for all model predictions. 

To compare the GIA models, we used Stokes coefficients that relate to their grav- 
itational signal to determine the approximate magnitude of the effect of applying 
each correction to GRACE data (Extended Data Table 2). This is a preliminary 
assessment, because the effect of applying a GIA correction depends also on the 
methods used to process the GRACE data. Moreover, an agreement on the modelling 
of feedbacks has so far not been reached within the GIA community, leading to a 
large spread in the modelled degree-2 coefficients and possibly a strong bias when 
a correction is applied that is inconsistent with the GRACE observations (up to 
about 40 Gt yr~). In addition, none of the current GIA datasets includes estimates 
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of the GIA-induced geocentre motion (degree-1 coefficients). Therefore, we omit 
degree-1 and -2 coefficients in our assessment of the GIA-induced apparent mass 
change. From models that represent GIA in Antarctic only, we estimate that this 
omission could change the apparent mass-change value by up to 20%; however, 
this potential error is not currently included in the GIA error budget. There is 
relatively good agreement between the ten models that cover all of Antarctica 
(Extended Data Table 2); the estimated GIA contribution ranges from +12 Gt yr} 
to +81 Gt yr” |; the mean value is 56 Gt yr_'. Although the solution from ref. “* is 
a notable outlier, it is the only one to account for three-dimensional variations in 
Earth’s rheology. It will be interesting to compare this result with other such models 
that are under development. 

Two of the GIA models” are regional: although they cannot be compared with 

the continental-scale models directly, the magnitude of their signals is nonetheless 
included for interest. 
Mass-balance intra-comparison. First, we compare estimates of mass change 
within each of the three geodetic-technique experiment groups to assess the 
degree to which results from common techniques concur and to derive individ- 
ual, aggregated estimates of mass change from each technique. In each case, we 
compare estimated rates of mass change derived from a common technique over 
a common geographical region and over the full period of the respective datasets. 
Where datasets were computed using both drainage-basin definitions, we present 
the arithmetic mean of the two estimates. This is justified because the choice of 
drainage-basin set has a very small (less than 10 Gt yr~') effect on estimates of mass 
balance at the ice-sheet scale and even less at the regional scale. Within each exper- 
iment group, we perform an unweighted average of all individual data to obtain 
a single estimate of the rate of mass change per ice sheet for each geodetic tech- 
nique. In a few cases, it was not possible to determine time-varying rates of mass 
change from individual estimates, because only constant rates of mass change and 
constant cumulative mass changes were supplied. Although the effect of averaging 
these datasets with time-varying solutions is to dampen the temporal variability 
present within the series of finer resolution, they are retained for completeness. 
We estimate the uncertainty of the average mass trends that emerge from each 
experiment group as the average of the errors associated with each individual 
estimate at each epoch. 

To aid comparison, we (i) compute time-variable rates of mass change and their 

associated uncertainty over successive 36-month periods stepped in one-month 
intervals from time-varying cumulative mass changes, and (ii) average rates of 
mass change over one-year periods to remove signals associated with seasonal 
cycles. Time-varying rates of mass change are truncated at the start and end of each 
series to reflect the half-width of the time interval over which rates are computed, 
although this period is recovered on integration to cumulative mass changes. The 
extent to which we are able to analyse differences in mass-balance solutions that 
emerge from common satellite approaches is limited by the mismatch in temporal 
resolution of the individual datasets, which makes methodological and sampling 
differences difficult to separate. 
Gravimetry mass-balance intra-comparison. Within the gravimetry experiment 
group, we assessed 15 estimates of mass balance derived from the GRACE sat- 
ellites, in entirety spanning the period July 2002 to September 2016. Of these 
datasets, four?”~°° were derived with direct imposition of the GRACE level-1 
K-band range data. These impositions result in four different, independently 
derived, mascon approaches. Other methods often refer to ‘mascon analysis, 
but are conducted on post-spherical-harmonic expansions and without impos- 
ing the level-1 K-band range data. We distinguish the later methods, referring 
to them as ‘post-spherical-harmonic mascons. Eleven contributions are derived 
from monthly spherical-harmonic solutions of the global gravity field using dif- 
ferent approaches****!~®, which can be loosely classified as (i) region-integra- 
tion approaches*>®, (ii) post-spherical-harmonic mascon approaches**°!-®, 
(iii) forward-modelling approaches*™, which essentially involve modelling of 
mass change with iterative comparison to the GRACE-derived signal, and (iv) 
approaches that use Slepian functions®’. One final estimate®* made use of satellite 
altimetry data; although this estimate was excluded from our gravity ensemble 
average because it is a hybrid solution, it is presented alongside the gravimetry-only 
results for comparison. No restrictions were imposed on the choice of GIA correc- 
tion; among the GRACE solutions we consider six different models?!747630:3143, 
However, we did assess a wider set of nine continent-wide forward models and two 
regional models to better understand uncertainties in the GIA signal. 

In total, there were 15 estimates of mass balance for each of the APIS, WAIS 
and EAIS. All were time-varying, cumulative mass-change solutions—the primary 
GRACE observable—and we computed time-varying rates of mass change from 
these data. Combining all of the individual mass-balance estimates, the effective 
(average) temporal resolution of the aggregated solution is one year. Further details 
of the gravimetry datasets and methods are included in Supplementary Table 1. 

In Extended Data Fig. 5 we show a comparison of the rates of mass change 
obtained from all gravimetry mass-balance solutions, calculated over the three 


main ice-sheet regions. At individual epochs, differences between time-varying 
rates of mass change are generally less than 50 Gt yr! in each ice-sheet region, and 
typically in the range 10-20 Gt yr~'. Over the full period of the data, individual 
rates of mass balance for the APIS, WAIS and EAIS vary between —80 Gt yr~! and 
+10 Gt yr~!, —260 Gt yr! and —20 Gt yr~!, and —120 Gt yr“! and +200 Gt yr}, 
respectively. Considering all of the gravimetry data (Extended Data Table 3), the 
standard deviation of mass trends estimated during the period 2005-2015 is less 
than 24 Gt yr“ in all three ice-sheet regions, with the largest spread occurring 
in the EAIS. In all three ice-sheet regions, the spread of individual mass balance 
estimates is well represented by the mean, considering the uncertainties of the 
individual and aggregated datasets. 
Altimetry mass-balance intra-comparison. We assessed seven radar- and laser- 
altimetry-derived AIS mass-balance datasets, in entirety spanning the period April 
1992 to July 2017. In total, six estimates of mass change were for the APIS, seven 
were for the EAIS and seven were for the WAIS. Of these, four included data from 
radar altimetry and six from laser altimetry. Various techniques were used to derive 
the elevation and mass trends®-”°. Only two of the altimetry datasets were time 
series of cumulative mass change, from which we computed time-varying rates of 
mass change. The remaining altimetry datasets were constant rates of mass change, 
which appear in our altimetry average as time-invariant solutions. The period 
over which altimetry rates of mass change were computed ranged from 2 years to 
24 years. In consequence, the aggregated dataset has a temporal resolution that is 
lower than annual. Including all individual mass-balance datasets, the effective 
(average) temporal resolution of the aggregated solution is 3.3 years. Further details 
of the altimetry datasets and methods are included in Supplementary Table 1. 
With a few exceptions, rates of mass change determined from radar and laser 
altimetry tend to differ by less than 100 Gt yr at all times in each ice-sheet region 
(Extended Data Fig. 5). The main exceptions are in the EAIS, where one estimate” 
reports mass trends that are roughly 100 Gt yr more positive than all others 
during the ERS and ICESat periods, and in the WAIS, where two estimates’)74 
report rates that are about 70 Gt yr! less negative than the others during the 
ICESat period. Among the remaining datasets, the closest agreement occurs at 
the APIS, where mass trends agree to within 30 Gt yr! at all times; the poorest 
agreement occurs at the EAIS, where mass trends depart by up to 100 Gt yr_!. The 
largest differences are between datasets that are constant in time during periods 
where rapid changes in mass balance occur in the annually resolved time series, 
suggesting that a proportion of the difference is due to their poor temporal reso- 
lution. Mass-balance solutions from the relatively short (six-year) ICESat mission 
also appear to show larger spreads compared to those determined from longer 
(decade-scale) radar-altimetry missions. This larger spread is due in part to differ- 
ences in the bias-correction models applied to ICESat data”*’*-’8 and in part to the 
large influence of firn densification on altimetry measurements over short periods, 
which have been corrected for using different models. Firn-densification models 
are generally not applied to mass-balance solutions determined from radar altime- 
try. Further analysis of the corrections for bias between ICESat campaigns and firn 
compaction is required to establish the statistical significance of the differences and 
to reduce their collective uncertainty. Comparing rates of mass change (Extended 
Data Table 3), the average standard deviation of all mass trends at each epoch over 
the common period 2005-2015 is less than 54 Gt yr in all four ice-sheet regions. 
The largest spread between the individual values occurs in the EAIS. Other than 
this sector, the individual estimates lie close to the ensemble average, considering 
the respective uncertainty of the measurements. 
Input-output-method intra-comparison. Although the input-output method is 
the most direct measure of changes in mass fluxes, a key difficulty is that, to assess 
mass balance, it must differentiate between two large numbers—one for annual 
SMB and the other for discharge plus grounding-line migrations—and deal appro- 
priately with the error budgets of both. A consequence of this complexity is that 
few input-output-method datasets exist at the ice-sheet scale. Here we collate just 
two input-output-method datasets, both based on the same method’°—far fewer 
than were considered for altimetry and gravimetry. The first dataset spans the 
period 1992-2010"; the second is limited to the period 2002-2016. The same SMB 
model (RACMO2.3) was used in both assessments. Further details of the input- 
output-method datasets and methods are included in Supplementary Table 1. 
We compare the two datasets during the period 2002-2010 (when the datasets 
overlap; Extended Data Table 3). The smallest differences (up to 30 Gt yr!) arise 
in the APIS and the WAIS; the largest differences (up to 70 Gt yr‘) occur at 
the EAIS. In all cases, the average difference between estimates of mass balance 
derived from each dataset is comparable to the estimated uncertainty. Including 
both datasets, rates of mass balance over the period 1992-2016 for the APIS, WAIS 
and EAIS range from —125 Gt yr! to +25 Gt yr~!, —300 Gt yr~! to +100 Gt yr“! 
and —200 Gt yr! to +200 Gt yr“, respectively (Extended Data Fig. 5). The origin 
of the differences between the two datasets requires further investigation. 
Ice-sheet mass-balance inter-comparison. To assess the degree to which 
the satellite techniques concur, we used the aggregated time series from each 
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geodetic-technique experiment group to compute changes in ice-sheet mass bal- 
ance within common geographical regions and over a common interval of time 
(the overlap period). We calculate the aggregated time series as the arithmetic mean 
of all available rates of ice-sheet mass balance derived from the same satellite tech- 
nique at each available epoch. We used the individual ice sheets and their integrals 
as common geographical regions. The maximum duration of the overlap period is 
limited to the 14-year interval (2002-2016) when all three satellite techniques were 
optimally operational. However, we also considered the availability of mass-balance 
datasets, which leads us to select the period 2003-2010 as the optimal interval (see 
Fig. 1). When the aggregated mass-balance data from all three experiment groups 
are degraded to a common temporal resolution of 36 months, the time series are 
on average well correlated (0.5 < r* < 0.9) at the APIS and WAIS. At the EAIS, 
however, the aggregated altimetry mass-balance time series are poorly correlated 
(? < 0.1) in time with the aggregated gravimetry and input-output-method data. 
Possible explanations for this include the relatively high short-term variability in 
mass fluctuations in this region, the relatively low trend in mass and the hetero- 
geneous temporal resolution of the aggregated altimetry dataset. Over longer 
periods, marked increases in the rate of mass loss from the WAIS are also recorded 
in all three satellite datasets. 

Because the comparison period is long in relation to the timescales over 
which SMB fluctuations typically occur, their potential effect on the overall 
inter-comparison is reduced. The closest agreement between individual estimates 
of ice-sheet mass balance occurs at the APIS and the WAIS, where the stand- 
ard deviation across all techniques is 15-41 Gt yr! (Extended Data Table 4). 
The greatest departure occurs at the EAIS, where the input-output-method and 
gravimetry estimates of mass balance differ by about 80 Gt yr! and the standard 
deviation of all three estimates is about 40 Gt yr’. This high degree of variance is 
expected because of the relatively large size of the region, the small amplitude of 
signals and the poor independent controls on coastal SMB. When compared to 
the inter-technique mean and standard deviation, all estimates of ice-sheet mass 
balance determined from the individual satellite techniques are now in agreement, 
given their respective uncertainties. In contrast to the first IMBIE assessment’, 
this finding also now holds at continental and global scales. We therefore conclude 
that estimates of mass balance determined from independent geodetic techniques 
agree when compared to their respective uncertainties. 

Several noteworthy patterns in the distribution of mass-balance estimates deter- 

mined during the overlap period (2003-2010) merit further discussion. Estimates 
of mass balance derived from satellite altimetry and gravimetry agree to within 
15 Gt yr! on average and with the mean of all three techniques, in all ice-sheet 
regions. By contrast, estimates of mass balance determined from the input-output 
method are 55 Gt yr! more negative on average than the mean in all ice-sheet 
regions. However, despite the bias, the input-output-method estimates remain in 
agreement because their estimated uncertainties are relatively large (approximately 
three times larger than those of the other techniques). A more detailed analysis 
of the primary and ancillary datasets is required to establish whether this bias is 
significant or systematic. 
Ice-sheet mass-balance integration. We combined estimates of ice-sheet mass 
balance derived from each geodetic-technique experiment group to produce a 
single, reconciled estimate, following the same approach as for the first assessment. 
This estimate was computed as the arithmetic mean of the average rates of mass 
change from each experiment group, within the regions of interest and at the time 
periods for which the experiment-group mass trends were determined. We esti- 
mated the uncertainty of the mass-balance data using the following approach. 
Within each experiment group, we estimated the uncertainty of mass trends as the 
average of the errors associated with each individual estimate and the uncertainty 
of reconciled rates of mass change (see, for example, Table 1) as the root-mean- 
square of the uncertainties associated with mass trends from each experiment 
group. When summing mass trends of multiple ice sheets, the combined uncer- 
tainty was estimated as the root-sum-square of the uncertainties for each region. 
Finally, to estimate the cumulative uncertainty of mass changes over time, we 
weighted the annual uncertainty by 1/./n, where n is the number of years 
since the start of each time series, and summed the weighted annual uncertainties 
over time®”. 

Across the full 25-year survey, the average rate of mass balance of the AIS was 
—109 + 56 Gt yr! (Table 1). To investigate inter-annual variability, we also cal- 
culated mass trends during successive five-year intervals. Whereas the APIS and 
WAIS each lost mass throughout the entire survey period, the EAIS experienced 
alternating periods of mass loss and mass gain, probably driven by inter-annual 
fluctuations in SMB. The rate of mass loss from the WAIS has increased over 
time owing to accelerated ice discharge in the Amundsen Sea sector??47"7381-83, 
The largest increase—a doubling of the rate of ice loss—occurred between the 
periods 2002-2007 and 2007-2012 (Table 1). Overall, the WAIS accounts for 
the vast majority of ice-mass losses from Antarctica. At the APIS, rates of ice-mass 
loss since the early 2000s are notably higher than during the previous decade, 
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consistent with observations of surface lowering’»”* and increased ice flow in 


southerly glacier catchments*. The approximate state of balance of the wider EAIS 
suggests that the reported dynamic thinning of the Totten and Cook glaciers***° 
has been offset by accumulation gains elsewhere®”. 
Data availability. The final mass-balance datasets generated in this study are freely 
available at http://www.imbie.org/data-downloads. 
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Extended Data Fig. 1 | Datasets of ice-sheet mass balance included in our assessment. Details about the datasets are provided in Supplementary 
Table 1. Some datasets did not encompass all three ice sheets. 
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Extended Data Fig. 2 | Ice-sheet drainage basins. AIS drainage basins 
are determined according to the definitions of ref. 3 (eft) and refs 1° 
(right). Basins that fall within the Antarctic Peninsula, West Antarctica 
and East Antarctica are shown in green, pink and blue, respectively. For 
the definition from ref. *, the Antarctic Peninsula, West Antarctica and 


East Antarctica basins cover areas of 227,725 km/?, 1,748,200 km? and 
9,909,800 km?, respectively. For the definition from refs *!°, the Antarctic 
Peninsula, West Antarctica and East Antarctica basins cover areas of 
232,950 km’, 2,039,525 km? and 9,620,225 km?, respectively. 
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Extended Data Fig. 3 | Temporal variations in AIS SMB. We show time 
series of integrated SMB in AIS drainage regions”!” from the MARv2.6 
(blue) and RACMO2.3p2 (red) models. Solid lines are annual averages of 
the monthly data (dashed lines). mo, month. 
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Extended Data Fig. 4 | Modelled GIA beneath the AIS. a, Bedrock uplift rates in Antarctica averaged over the GIA model solutions used in this 
assessment. b, The corresponding standard deviations. 
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Extended Data Fig. 5 | Individual rates of ice-sheet mass balance. Antarctica (c, f, i). The light-grey shading shows the estimated lo 

a-i, Mass-balance estimates were determined from satellite altimetry uncertainty relative to the ensemble average. The standard error of the 
(a-c), gravimetry (d-e) and the input-output method (g-i) for the mean solutions, per epoch, is shown in mid-grey. 
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Extended Data Table 1 | Spatially averaged AIS SMB 


Area SMB 

RACMO2.3p2 RCM 12.30 27km 2107 
MARv3.6.40 RCM 12.32 35km 2150 
ERA-Interim GCM 12.20 80km 1900 
JRA55 GCM 12.24 55km 1807 


Estimates of the average SMB over the period 1980-2012 were derived from regional climate models (RCM) and global reanalyses (GCM). Data were evaluated using the drainage basins from refs 
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Extended Data Table 2 | GIA model details 
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4Main publication®*'6 listed; supporting publications are provided in Supplementary Table 1. 
’Model from main publication unless otherwise stated. Comma-separated values refer to properties of a radially varying (1D, one-dimensional) Earth model: the first value is lithosphere thickness 
(km); other values reflect mantle viscosity (x 10?! Pa s) for specific layers; see relevant publications for details. 
“Ice model covers at least the Last Glacial Maximum to present, unless otherwise indicated. 


4GIA model details: SH, spherical harmonic (maximum degree in 
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‘Different to ICE-6G_C in Antarctica, owing to the use of BEDMAP2! topography in that region. 
&Model relates to GIA in the northern Antarctic Peninsula (nAPIS) only. 
5Model relates to GIA in the Amundsen Sea (AS) embayment only. 


‘Earth model from ref. 24. 
‘Ice model from ref. °°. 
KGIA model from ref. 8°. 
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Extended Data Table 3 | Features of mass-balance datasets included in our assessment 


dM/dt 
Temporal dM/dt 
dM/dt range standard 
Region Technique Span (years) resolution error 
(Gt/yr) deviation 
(months) (Gt/yr) 
(Gt/yr) 


APIS Gravimetry | 2005 to 2015 1 to 24 10 
WAIS | Gravimetry | 2005 to 2015 -177 to-114 | 1to30 16 


Gravimetry 2005 to 2015 1to 12 +11 to +107 2 to 35 


Altimetry 1992 to 2017 1 to 8.25 -29 to -3 2 to 17 


Altimetry 1992 to 2017 1 to 8.25 -97 to -25 4 to 39 


Altimetry 1992 to 2017 1 to 8.25 -11 to +136 10 to 52 


Mass Budget 1992 to 2016 -120 to +20 30 


Mass Budget 1992 to 2016 -250 to +100 


EAIS | Mass Budget | 1992 to 2016 -200 to +200 65 95 


Details shown include the maximum span, temporal sampling, amplitude, estimated error and standard deviation at each epoch. 
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Extended Data Table 4 | Aggregated estimates of ice-sheet mass balance from satellite altimetry, gravimetry and the input-output method 


Altimetry mass | Gravimetry mass Mass budget Average mass 
balance (Gt/yr) balance (Gt/yr) mass balance balance (Gt/yr) 
(Gt/yr) 
-35+65 15+41 
-115 + 43 -93 + 26 
-51+24 -27+15 
-201 + 82 -105+51 


In this comparison, the data were averaged over the period 2003-2010. The arithmetic mean of each individual result is also shown for the given regions, along with the combined imbalance of the 
AIS, calculated as the sum of estimates from the constituent regions. 
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Trends and connections across the 
Antarctic cryosphere 


Andrew Shepherd!*, Helen Amanda Fricker? & Sinead Louise Farrell? 


Satellite observations have transformed our understanding of the Antarctic cryosphere. The continent holds the vast 
majority of Earth’s fresh water, and blankets swathes of the Southern Hemisphere in ice. Reductions in the thickness 
and extent of floating ice shelves have disturbed inland ice, triggering retreat, acceleration and drawdown of marine- 
terminating glaciers. The waxing and waning of Antarctic sea ice is one of Earth’s greatest seasonal habitat changes, and 
although the maximum extent of the sea ice has increased modestly since the 1970s, inter-annual variability is high, and 


there is evidence of longer-term decline in its extent. 


t the height of austral winter, Antarctica and the surrounding 

ocean are covered in a 31.6 million km? cap of ice (Fig. 1). Of 

this, approximately 18.5 million km? is formed as sea ice when 
the ocean freezes', 11.9 million km? is a near-permanent ice sheet resting 
on land or the sea floor? and 1.6 million km? is contained within long- 
lived ice shelves that are floating extensions of the continental ice’. All 
of Antarctica’ ice is mobile, driven by gravity and, where it is afloat, by 
the atmosphere and the ocean (Fig. 1). Each element plays a unique role 
in the climate system; for example, the grounded ice is Earth’s primary 
freshwater reservoir‘, the ice shelves are a major source of ocean fresh 
water?, and the sea ice is an important factor in the planetary albedo*. 

The greatest fluctuation in the extent of ice cover in the Southern 
Hemisphere is due to the seasonal cycle of sea ice formation, which is 
less than a metre thick on average’, and reduces to one-sixth of its peak 
area in summer®. The decadal trend in Antarctic sea ice extent has, never- 
theless, been modest’, and the most striking contemporary changes have 
occurred in other elements of the regional cryosphere. For example, the 
grounded ice sheet is estimated to have lost 2,720 + 1390 Gt of its mass 
between 1992 and 2017”, and its peripheral ice shelves are thinning in 
numerous sectors*!!~!? and collapsing'*'° at the Antarctic Peninsula. 
These trends reflect global and regional environmental forcing and are 
related through a variety of processes, each of which is now better under- 
stood thanks to the array of satellite observations that has been acquired 
over recent decades. 

Here we analyse the satellite record to examine continental- and region- 
al-scale trends in the Antarctic cryosphere, including fluctuations in the 
extent, thickness and movement of sea ice, ice shelves and the grounded 
ice sheet. We show that spaceborne measurements have allowed key 
events in Earth’s recent climate history to be charted in remarkable 
detail—including the collapse of ice shelves at the Antarctic Peninsula and 
the drawdown of glacier ice from West Antarctica—and have illuminated 
the key processes that are driving contemporary change. 


Grounded ice 

Fluctuations in the mass of the Antarctic ice sheet arise from differences 
between the net snow accumulation and ice discharge. In recent dec- 
ades, a variety of techniques have been developed to measure changes 
in the speed, elevation and weight of the grounded ice. Airborne radar 
measurements show that the Antarctic ice sheet is up to 4,897 m thick, 
and has the potential to raise global sea level by 58 m were it to be 


rapidly discharged’. It overlies terrain of variable geology and relief, and 
this has influenced both its formation and its contemporary dynamics. 
The continental-scale pattern of ice flow was first inferred from car- 
tographic’* and, more recently, satellite altimeter’” records of the ice 
sheet surface elevation. On this basis, it has been determined that most 
of Antarctica’s ice is routed into the Southern Ocean through around 
30 glaciers and ice streams (Fig. 1), each draining a substantial inland 
catchment’. 


Grounded ice imbalance 

The stability of Antarctica’s ice can be assessed by tracking the move- 
ment of its principal glaciers and ice streams. Although such glaciers 
and ice streams are few in number, they are vast, putting this task 
beyond the scope of ground surveys. The first remote measurements 
of ice motion were made possible by repeat satellite optical imagery’ 
and, subsequently, by synthetic aperture radar interferometry’®. Thanks 
to step increases in the quantity of satellite image acquisitions over 
time, systematic surveys of ice flow across and around the continent 
have now been completed”°, revealing anomalous behaviour in much of 
Marie Byrd Land?! and also at isolated sites at the Siple Coast”’, at the 
Antarctic Peninsula**~*°, and in East Antarctica?”?*. In most of these 
places, the pace of ice flow has increased during the satellite era, and, 
when considered as a whole, the rate of ice discharge from Antarctica 
exceeds inland snow accumulation”. 

In addition to changes in ice discharge, fluctuations in ice sheet mass 
can be detected through satellite measurements of the volume*”* and 
gravitational attraction**™ of the ice sheets. Although all three methods 
lead to similar results at the continental scale, each approach has its 
merits, and they are now viewed as being complementary. So far, there 
have been over 150 individual assessments of ice loss from Antarctica 
based on these approaches”? and, when collated!®*°, these studies show 
that the continent has contributed 7.6 + 3.9mm to global sea levels 
since 1992. Two-fifths (3.0 + 0.6 mm) of this loss occurred during the 
past five years'®. Although the rate of ice loss from the entire Antarctic 
ice sheet has changed little during the satellite record, speedup of glacier 
flow in the Amundsen Sea sector has led to accelerated losses from this 


Satellite radar altimetry is an especially powerful tool for ice sheet gla- 
ciology, because the technique can be used to resolve the detailed pattern 
of imbalance across individual glacier catchments (for example, ref. 22). 
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Fig. 1 | Average annual motion of the 
Antarctic ice sheet and ice shelves, and of 

the surrounding sea ice in winter. The ice 
sheet is drained by around 30 principal flow 
units and the sea ice transport is generally 
northwards, with gyres in the Ross and Weddell 
seas. Grounded-ice and ice-shelf motion are 
derived from multiple satellite interferometric 
synthetic aperture radar data acquired”’ between 
2007 and 2009. Ice sheet motion flowlines 

are superimposed on the MODIS mosaic of 
Antarctica’”’. Imagery from the NASA MODIS 
instrument, courtesy NASA NSIDC DAAC. 

Sea ice motion is the mean of daily gridded 
Polar Pathfinder radiometry obtained during 
peak winter (September) of each year in the 
period’? 1990 to 2016. Sea ice motion vectors 
are superimposed on a map of mean sea ice 
concentration derived from passive microwave 
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around much of the continent, with monthly sampling, and over 
multi-decadal periods (Fig. 2). This allows signals of short-term var- 
iability to be separated from longer-term trends. Although most of 
Antarctica has remained stable over the past 25 years, there are clear 
patterns of imbalance in many coastal sectors—such as the thickening 
of the Kamb Ice Stream and the thinning of glaciers flowing into the 
Amundsen Sea and at the Antarctic Peninsula. These changes reflect 
imbalance between ice flow and snow accumulation within the sur- 
rounding catchments. The pace of ice flow at the Kamb Ice Stream is 
unusually low” and has not altered in recent decades, but analysis of 
ice-penetrating radar measurements"! shows that it stagnated over a 
century ago. Elsewhere, inland glacier thinning is almost exclusively 
coincident with contemporaneous ice speedup”! (indicating that 
the thinning is dynamic in nature) and with perturbations at the marine 
termini of the glaciers (indicating that the thinning has resulted from 
ocean forcing). 


Active subglacial lakes 

A surprising application of satellite observations has been the mon- 
itoring of movement of water beneath the Antarctic Ice Sheet. Over 
three hundred subglacial lakes—bodies of liquid water at the ice sheet 
base—have been discovered in Antarctica (Fig. 1) using ice-penetrating 
radar”, and these were at first considered to be isolated and stable res- 
ervoirs. However, localized and episodic rises and falls of the ice sheet 
surface were then spotted in satellite interferometric*® and altimetric 
records*”8, suggesting otherwise. These fluctuations, amounting to 
changes in height of 1-10 m over sub-decadal timescales, are inter- 
preted to be the surface expressions of water transferring between active 
subglacial lake networks. More than a hundred active lakes have now 
been identified using this approach”, and monitoring of their evolu- 
tion has led to improved understanding of how Antarctic subglacial 
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brightness temperatures!“* in September 1990 

to 2016. Also shown are the average minimum 
extent of sea ice recorded’ between 1990 to 
2016 (blue dashed boundaries), the grounded 

ice sheet and the floating ice shelves (black 
boundaries), and the bathymetry of the 
surrounding ocean!*>. Active subglacial lakes 
(light blue) were mapped using satellite radar and 
laser altimetry*!. 


concentration (%) 


water systems evolve, and the consequences of this variability”. At the 
Whillans, Mercer and Recovery ice streams, the Crane and Byrd gla- 
ciers, and in eastern Wilkes Land, for example, more than a decade of 
satellite measurements have been acquired®!. Thanks to these data, we 
now know that in addition to periodically flushing subglacial cavities, 
the presence” of and fluctuations?’ in subglacial lake water can lubri- 
cate ice flow in parts of the continent. 


Ice shelves 

When Antarctic glacier ice reaches the ocean it often remains intact, 
forming floating ice shelves in sheltered embayments. All together 
there are more than 300 Antarctic ice shelves, fringing three-quar- 
ters of the continent and extending the ice sheet area* by some 13%. 
Their average thicknesses range*’ from 300 m to 2,500 m, and peak at 
the grounding line, where the shelves are fed by inland glaciers. Ice 
shelves can provide mechanical support for the grounded ice sheet 
upstream, through contact with confining side walls or sea mounts™. 
Downstream, they thin as the ice spreads, and they gain and lose addi- 
tional mass primarily through snow accumulation, iceberg calving 
and basal ice melting. Basal melting is driven by several processes” 
including the formation of high-salinity water during winter sea ice 
growth, tidal mixing of seasonally warm water, and the intrusion 
of warm ocean currents into sub-shelf cavities. Meteorological and 
oceanographic conditions can also lead to surface melting and basal 
ice freezing. In some cases, it can take more than a thousand years for 
ice to travel through Antarctic ice shelves from the grounding line to 
the calving front®, and geological records show that they have been 
a persistent element of the climate system throughout the Holocene 
period (for example, see ref. °°). Their dependence on a wide range 
of factors makes ice shelves a sensitive indicator of environmental 
change*”. 
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Fig. 2 | Average trend in the elevation and thickness of Antarctic 
grounded ice and ice shelves, respectively, determined between 1992 and 
2017 north of 81.5° S (dashed grey circle), and between 2010 and 2017 
elsewhere. Also shown is the depth of (from the PANGEA database in the 
supplement to ref. '“°) and estimated ocean temperature!” at the sea floor 
around the continent. Changes in grounded-ice and ice-shelf thickness 
were estimated using repeat satellite altimetry following the methods of 
refs *3° and ‘48. Thickness trends are superimposed on an optical image 
mosaic of the floating and grounded ice’, and is divided (grey lines) 

into the principal ice drainage catchments”. Since 1992, the grounded 


Ice shelf imbalance 

Trends in ice shelf area, thickness and flow can be detected using a 
wide range of satellite sensors, and a host of other properties can be 
inferred from these measurements. Ice shelf area can be measured 
using optical and radar satellite imagery, and this has been used, for 
example, to chart long-term changes in their extent". A series of satel- 
lite radar and laser altimeter missions have provided near-continuous 
observations of ice shelf surface elevation for several decades, and 
these have formed the basis of ice shelf thickness*® and thickness 
change*'!° estimates on the assumption that the ice is buoyant within 
the surrounding ocean. These estimates require careful treatment of 
fluctuations in ocean tide’? and of changes in the firn column thick- 
ness, When combined, measurements of ice shelf area and thick- 
ness change allow the volume and mass trends of the ice shelves to 
be derived. Ice shelf flow can be monitored with repeat pass satellite 
optical®' and radar® imagery and, if contemporaneous changes in both 
the flow and thickness of ice shelves are available, the rate of steady 
state and net!*6+° basal ice melting can be determined. 

Analysis of ice shelf surface elevation measurements derived from 
multi-mission satellite altimetry (Fig. 2) has allowed the decadal 
mass change and principal environmental forcing mechanisms of ice 
shelves to be identified*!!°**°, Although the major Ross, Filchner- 
Ronne, and Amery ice shelves have remained stable since the 1990s, 
many ice shelves in West Antarctica have experienced long-term thin- 
ning over the same period. In the locations where retreat or thinning 


ice sheet and its peripheral ice shelves have thinned in locations adjacent 

to warm ocean currents. Although the East Antarctic Ice Sheet is mostly 
stable, there have been marked changes in West Antarctica, including 
accelerated thinning of glaciers draining the Amundsen Sea sector and 
constant thickening of glaciers in southerly catchments of the Siple Coast. 
This thinning is a response to ocean-driven melting of ice shelves at glacier 
termini’’, and the thickening is associated with stagnation of ice flow due to 
a loss of basal lubrication!’. At the Antarctic Peninsula, ice shelf collapse’4 
has triggered inland glacier acceleration*®®* and thinning’>. Imagery from 
the NASA MODIS instrument, courtesy NASA NSIDC DAAC. 


have occurred, the grounded ice inland has also been destabilized. 
The dominant control on this pattern is believed to be the presence 
(or absence) of warm ocean currents offshore”. Altogether, the volume 
of Antarctic ice shelves has declined through net overall thinning 
(166 +48 km? yr~! between 1994 and 2012; ref. '!) and through 
progressive calving-front retreat of those at the Antarctic Peninsula 
(210 +27 km? yr! between 1994 and 2008; ref. *). Combined, these 
losses amount to less than 1% of their volume. However, the high- 
est ice shelf thinning rates have occurred in the Amundsen and 
Bellingshausen seas’”, where five have lost between 10% and 18% 
of their thickness!’ owing to ocean-driven melting at their bases®”. 
The effects of the wider El Nifio-Southern Oscillation have also been 
detected alongside these longer-term trends”. 


Ice shelf collapse 

Ice shelves at the Antarctic Peninsula (Fig. 3) are especially vulnerable, 
because they are situated at the most northerly latitude on the continent, 
where temperatures are relatively high and summertime melting is 
common. In recent decades, a number of these ice shelves have disin- 
tegrated in part or entirely'*. Notable examples include the substantial 
(>70%) retreat of the Larsen B!>® and Wilkins™ ice shelves, and the 
effective collapse (>90%) of the Prince Gustav Channel”, Larsen 
A”!, and Wordie”, ice shelves. Since the 1950s, the combined area!*” 
of Antarctic ice shelves lost through retreat and collapse has been 
33,917 km”, or 22% of their original extent. Analysis of the geological 
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record*®”* has confirmed that the collapse events are unique during 
the Holocene period. 

The retreat and collapse of Antarctic Peninsula ice shelves has 
occurred in tandem with a rapid regional atmospheric warming hap- 
pening at several times the global trend*’. These events have been 
linked; warmer air temperatures lead to intensified surface melting”> > 
which is believed to cause hydraulic fracture of surface crevasses fol- 
lowed by ice shelf collapse’®. Several Antarctic Peninsula ice shelves 
have also thinned in the decades leading up to their collapse’»!>>?, 
primarily through ocean-driven melting at their base. This thinning 
may contribute to instability by weakening ice shelf lateral margins 
before fracture’’, and by enhancing rates of iceberg calving’”®. The rela- 
tionship is, however, not universal; for example, although the Wilkins 
ice shelf collapsed in 2009, it did not thin in the preceding five years”. 
Indeed, recent satellite altimetry®° shows that the surface elevation of 
the Larsen C ice shelf increased in the preceding decade, in response to 
cooler (and not warmer) summertime temperatures. And although the 
observational evidence suggests that stability of Antarctic ice shelves 
depends also on their thickness, geographical location or setting, the 
extent to which those shelves that are partially or wholly intact will 
continue to resist collapse remains uncertain. 

The collapse of ice shelves does not contribute directly to sea level rise, 
because they are afloat. However, there is an indirect effect: observations 
show that the grounded tributaries to the Larsen A*! and Larsen B!** 
ice shelves did speed up in response to the removal of the floating ice, 
which is presumed to have offered resistance. So far, ice shelf retreat and 
collapse has been restricted to those shelves situated at the Antarctic 
Peninsula, in relatively warm climates, and has not threatened those far- 
ther south, on the fringes of the East and West Antarctic Ice Sheets. The 
largest recorded reduction in ice shelf area at the Antarctic Peninsula so 
far has been the calving of the 11,095 km? A-68 tabular iceberg (Fig. 3) 
from the Larsen C ice shelf”? in 2016. However, this iceberg represented 
just 7% of the ice shelf’s area, and was similar in size to one (the A-20 
iceberg) that broke free®’ in 1986. It is, therefore, not without prece- 
dent—even during the relatively short satellite era—and there is as yet 
no evidence that either breakaway disturbed the remaining ice shelf, or 
was anything other than routine iceberg production. 


Buttressing of grounded ice 

The term ‘buttressing’ is used to describe the resistive forces imparted 
to a grounded ice sheet by its peripheral ice shelves. In its absence, 
rates of ice sheet discharge increase nonlinearly with ice thickness, 
making the grounding lines of marine-based ice sheets difficult to 
stabilize because their bedrock tends to deepen inland®’. Floating ice 
shelves can, however, exert drag as they flow over and around sea- 
mounts (pinning points) or as a result of their lateral confinement, 
and the extent to which this drag can mitigate unstable retreat has long 
been the subject of glaciological debate**. In recent years, the rapid 
response of Antarctic glaciers to the collapse and thinning of ice shelves 
at their termini has led to a reassessment of their resistive properties. 
At the Antarctic Peninsula, for example, glaciers flowing into the for- 
mer Larsen A, Larsen B and Wordie ice shelves have surged”+68.84 after 
their collapse, as too have ice streams flowing into the Amundsen 
and Bellingshausen seas”! in the wake of ice shelf thinning®!*”? 
and grounding-line retreat®**°, Although the events at the Antarctic 
Peninsula have been attributed to the destabilizing effect of increased 
melting at the surface of ice shelves, following regional atmospheric 
warming’, the events in the Amundsen and Bellingshausen seas are 
now firmly linked to enhanced ocean-driven melting at the bases of the 
ice shelves, owing to the intrusion of warm circumpolar deep water into 
the cavities beneath them®”*’, 

Although the reservoir of grounded ice at the Antarctic Peninsula is 
relatively modest!, destabilization of the Amundsen Sea sector glaciers 
is a matter of considerable concern, because the pace of ice drawdown 
during the satellite era has been swift®’, and because these glaciers con- 
tain enough ice to raise global sea levels by more than a metre*. Over 
the past two decades, for example, surface lowering has spread inland 


226 | NATURE | VOL 558 | 14 JUNE 2018 


2 Prince Gustav Channel (1,621 km?) 


af 
“Oy 


pe, AL eee km?) 
4 ( S$ Larsen B (9,166 km?) 


Cup 


Ry 


Fig. 3 | Temporal changes in the location of ice shelf barriers at the 
Antarctic Peninsula. The outlines of the ice shelves (coloured lines) are as 
determined from satellite imagery since the 1950s, and the net reduction 
in area over the same period is shown in parentheses'*”’. Pale (beige) areas 
are ice shelves, darker (brown) areas are grounded ice. The reduction in 
area at the Larsen C ice shelf includes the recent calving of the A-68 tabular 
iceberg. In the 1950s, the total area of Antarctic Peninsula ice shelves was 
estimated to be 152,246 km. Since then, an area of 33,917 km? has been 
lost during episodic calving events. 


across the drainage basins of the Pine Island and Thwaites glaciers at 
speeds of between 5 and 15km yr7', and the majority of their catch- 
ments are now in a state of dynamical imbalance (they are thinning 
owing to accelerated flow). This rapid spreading is a consequence of 
several connected processes® (Box 1): ice shelf thinning leads to initial 
reductions in sidewall and basal traction at glacier termini, which then 
causes increased strain rates (flow) within the glacier ice upstream, 
followed by further grounding-line retreat caused by the associated 
ice thinning—especially in marine-based sectors of the continent. 
Glacier speedup may also lead to further reductions in sidewall trac- 
tion through rifting and fracture, and grounding-line retreat can expose 
more ice to the ocean melting responsible for the initial imbalance. 
Glaciers flowing into the Amundsen Sea sector of West Antarctica 
are particularly susceptible to climate forcing, owing to their geomet- 
rical configuration and the absence of any substantial ice shelf 
barrier”, and today the pace of ice sheet retreat along parts of this 
coastline dwarfs that during the Holocene period. The region’s ice 
shelves have thinned!” by 3 to 6m yr7', and its glacier grounding lines 
have retreated*>*° by 10 to 35km since 1992, which is 20 to 30 times 
the rate since the Last Glacial Maximum, according to analysis of the 
marine geological record®!. In response to these perturbations, the 
grounded glaciers inland have sped up”)? and thinned***” at faster 
rates. For example, since the early 1990s, rates of ice flow at the Pine 
Island Glacier terminus have increased* by about 1.5km yr7! and rates 
of ice thinning have risen® to over 5m yr~', and the sector overall 
contributed 4.5 mm to global sea level rise** between 1992 and 2013. 
The forcing for these events is now widely regarded to lie in the sur- 
rounding ocean, because ice drawdown has originated at and evolved 
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The figure depicts ice shelf buttressing processes, external forcings and their connectivity. Ice shelves are floating sheets of ice that form as 
glaciers spread out into the ocean, typically within confined embayments. They are permanently attached to the grounded ice sheet resting on 
land, and they gain mass through ice flow across the grounding line and local snowfall on the surface, and lose mass by melting at their bases 
and iceberg calving. If warm water enters the ocean cavity beneath an ice shelf it can drive increased basal ice melting and ice shelf thinning, 
which in turn leads to retreat of the grounding line—the junction between grounded and floating ice on the sea floor. Ice shelf thinning 
reduces sidewall (lateral) traction and grounding-line retreat reduces basal traction. Both processes lead to speedup of the grounded ice, 
which causes grounded ice thinning. Glacier speedup can also lead to weakening of lateral shear margins and increased crevassing. Iceberg 
calving can also lead to reduced sidewall traction. Sea ice (frozen sea water) can play a part through brine rejection, which drives the 
production of warm high-salinity shelf water, and by providing an additional buffer that effectively increases buttressing. 
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from the terminus of neighbouring but distinct ice flow units”, and 
because warm®’ and warming“ water is present within the cavities 
beneath their peripheral ice shelves. According to numerical simula- 
tions, the Pine Island and Thwaites glaciers**** may contribute a further 
4mm or so to global sea levels over the twenty-first century in response 
to continued forcing, and it has been concluded” that the region is now 
undergoing marine ice sheet instability, with no geometrical obstacles 
to prevent irreversible decline. However, satellite observations have 
revealed that retreat of the Pine Island Glacier halted around 2011, 
and that ice thinning inland abated in the following years®*®. This sug- 
gests that the situation is more complicated than a consideration of the 
glacier geometry alone, and may involve changes in the degree of ocean 
forcing, as has occurred in the recent past®”””. 


Sea ice 

Antarctic sea ice forms as the Southern Ocean surface freezes, and it 
interacts with the neighbouring ice shelves and grounded ice in many 
ways (for example, Box 1). Satellite observations (for example, Fig. 1) 
have allowed us to map the extent®?§, thickness?”! and drift!0!1 
of the sea ice, providing insight into the effects it has on climate® and 
ecosystems!. In winter, Antarctic sea ice extends from an inner 
zone of consolidated pack ice surrounding the continent, to the mar- 
ginal ice zone near the powerful Antarctic Circumpolar Current, 
where floes are less concentrated (Figs. 1 and 4). In summer, the 
sea ice pack retreats to isolated pockets fringing the continent. As 
it forms, Antarctic sea ice produces high-salinity shelf water when 
brine is rejected, which then sinks to the seabed. This water drives 


Sea ice 
as | 


Iceberg 


Brine 
rejection 


Tidal 
mixing 


Circumpolar 
deep water 


buoyant plumes within the cavities beneath floating ice shelves, which 
melt glacier ice at the grounding line, before returning to the open 
ocean along their base. Antarctic sea ice is also characterized by local 
polynyas—persistent gaps in the ice cover (for example, Fig. 4) that are 
sustained by upwelling warm water, winds, tides and ocean currents. 
These polynyas are a source of bottom water (dense water occupy- 
ing depths typically below 4,000 m), and provide a link between the 
ocean and atmosphere that affects weather and wildlife!®’. Landfast 
sea ice can also act to stabilize ice shelves and glacier tongues'™, and to 
suppress!° or—upon its breakup—enable! iceberg calving. 


Sea ice extent and drift 

Fluctuations in the area of Antarctic sea ice have been routinely 
charted since the late 1970s using passive microwave satellite imagery®. 
Annually, its average extent ranges from 3.1 x 10° km? in February to 
18.5 x 10° km? in September’ (for example, Fig. 1). In contrast to the 
Arctic, where the area of sea ice has declined progressively!, there has 
been a small, positive increase (1.6 + 0.4% per decade between 1979- 
2016) in the hemispheric sea ice extent of the Southern Ocean’. This 
trend runs counter to the projections of most climate models’”’, and 
has occurred alongside a slow warming (0.02 °C per decade since the 
1950s) of the Southern Ocean!®*. Despite the trend, there is some evi- 
dence of longer-term decline: reanalysis of early satellite records!°'!° 
and historical whale catch positions''! suggest there may have been 
more ice cover in the 1960s and early 1970s than there is today. In 
recent years, however, extreme changes have occurred—the extent of 
Antarctic sea ice reached record maxima in three successive winters 
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Fig. 4 | Sea ice in the Weddell Sea in early November 2017, based on 
a composite of Sentinel-1 synthetic aperture radar imagery, MODIS 
optical satellite imagery and ASCAT scatterometer data. The satellite 
data were acquired during the austral winter when the ice cover is close to 
its maximum extent, stretching beyond the tip of the Antarctic Peninsula 
into Drake Passage. The composite reveals details of the diffuse ice 

cover in the marginal ice zone along the boundary with the open ocean, 
and the more compact, consolidated ice cover farther south. A large 
(approximately 35,000 km’) ice-free area is visible near Maud Rise (66° S, 
5° E); this polynya is formed by thermally driven convection in the 


(2012, 2013 and 2014 ref. !!), followed by a record summertime min- 
imum!"3!4 in March 2017. 

Changes in both atmospheric and oceanic forcing affect the extent 
of Antarctic sea ice'!*''®, and although total hemispheric extent has 
shown little overall change, there have been considerable regional var- 
iations'!”18, Although the Weddell Sea, Indian Ocean and the western 
Pacific Ocean have all seen modest trends (1.7% + 0.8%, 1.7% £0.99% 
and 1.8% + 1.2% per decade, respectively) in sea ice extent during the 
satellite era (1979-2016), there have been more substantial trends 
(3.3% £0.9% and —2.9% + 1.4% per decade, respectively) in the Ross 
Sea and the Amundsen and Bellingshausen seas’. The periods during 
which the western Ross and Bellingshausen seas are ice-free in summer 
have also changed, decreasing and increasing by two and three months, 
respectively!!8, between 1979 and 2011. Seasonal and decadal trends 
in the Weddell and Ross seas (positive) and in the Amundsen and 
Bellingshausen seas and the western Pacific Ocean (negative) reflect the 
influence of atmospheric forcing!!’. Although these fluctuations in sea 
ice extent are strongly correlated with the dominant modes of Southern 
Hemisphere climate variability''”'”, other factors are involved’”°. A 
range of mechanisms have been explored, including changes in oce- 
anic variability!”!, atmospheric circulation!?”'”’, stratospheric ozone 
depletion’, meridional wind forcing!” and freshwater input from ice 
shelf melt!?>!”°, 

Understanding the role sea ice plays in the Antarctic climate 
system also requires a consideration of its dynamics'””. In the Southern 
Ocean, sea ice drifts northwards and diverges under the influence of 
winds and ocean currents (Fig. 1), and the fraction of open water is 
higher than in the Arctic’”*. Satellite observations have illuminated 
both local-scale!?”!5° and hemisphere-wide!®! 13! Antarctic sea 
ice dynamics. Strong, circumpolar, westerly winds drive sea ice east- 
wards in the outer zonal band, while a nearly continuous westward 
circumpolar flow exists along the coastal boundary’. Persistent 
atmospheric lows centred at the boundaries of major ocean basins 
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water column, owing to the interaction of ocean currents and seafloor 
topography. Coastal polynyas are visible along the edges of the Filchner- 
Ronne, Brunt and Stancombe-Willis ice shelves, where openings in the 

ice cover are driven by katabatic wind, tides and ocean currents. New sea 
ice, which rejects brine during formation, is continually produced in these 
polynyas, making polynyas a source of dense, high-salinity deep water, 
which plays an important part in the global thermohaline circulation. Also 
visible is the large, tabular A68 iceberg (located at about 68° S, 60.5° W), 
which calved from the Larsen C Ice Shelf on 12 July 2017, and is now adrift 
in the western Weddell Sea. 


are the dominant drivers of sea ice motion, and these sustain large- 
scale gyres in the Weddell and Ross seas!*. The speed of sea ice drift 
in the eastern Weddell and Ross seas has increased, in contrast to 
the western Weddell Sea, where it has decreased'©”!*”, though these 
signals are still small compared to the interannual variability!”'. The 
general northward trajectory of the Antarctic sea ice pack also affects 
its age and thickness; rarely does it survive for more than two years, 
and the average thickness of floes (typically in the range 0.6-1.2 m) and 
pressure ridges are smaller than in the Arctic Ocean’. Locally, katabatic 
winds, tides and ocean currents sustain coastal polynyas through sea 
ice drift around the continent (for example, Fig. 4), providing a link 
between the sea ice pack and the ice sheet through their initiation of 


plumes beneath floating ice shelves!¥?. 


Summary and outlook 

In just three decades, satellites have transformed our appreciation of the 
extent and pace of change in the Antarctic cryosphere. Despite being 
remote, fluctuations in its ice cover have a global impact. The conti- 
nent holds Earth’s primary freshwater reservoir* and, together with its 
surrounding ice shelves* and sea ice’, blankets 6% of the planet in ice 
during the austral winter. Although persistent ice shelves have fringed 
Antarctica for thousands of years®, there is now widespread evidence of 
changes in their extent!“ and thickness*!!. Altogether, their volume has 
decreased by more than 300 km? yr! since 1994*", notably due to col- 
lapse and calving at the Antarctic Peninsula and rapid thinning of those 
in the Amundsen and Bellingshausen seas. These events have triggered 
retreat®.®° and acceleration”! of marine-terminating glaciers and ice 
streams around the continent, leading to the drawdown of ice from 
their inland catchments**”. Since 1992, the grounded ice sheet has lost 
1,350 + 1,010 Gt of ice, causing a net 3.8 + 2.8mm contribution to global 
sea level rise**. The waxing and waning of Antarctic sea ice influences 
the planetary albedo, oceanic circulation, marine productivity and 
ecosystems®!°. Although its extent has increased by 1.6% + 0.4% per 
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Fig. 5 | Schematic of a sea ice floe as observed by CryoSat-2 and 

AltiKa. Floe thicknesses are typically derived from measurements of 

their freeboard (the portion protruding above the ocean surface), an 
estimate of the snow loading and the principle of buoyancy”. CryoSat-2 
and AltiKa operate at different radar frequencies, and their echoes scatter 
from locations near to the lower and upper boundaries of the snow layer, 
respectively'*?!4°. Measurements acquired at both frequencies could 
provide a direct measurement of the snow loading, improving the certainty 
of sea ice floe thickness estimates. 


decade since 1979”, there are large regional variations®!!”"'8, and there 


is evidence from historical records!!! ofa longer-term decline. These 
discoveries, and many more, have transformed our understanding 
of the state of Antarctic ice. 

Even though considerable progress has been made during the sat- 
ellite era, particularly on understanding the ice sheet, key questions 
remain unanswered. For example, the detailed pattern of glacier change 
at the Antarctic Peninsula is not well known, because the rugged ter- 
rain poses a challenge for traditional remote sensing methods. Though 
modest, the mass balance of the East Antarctic Ice Sheet nevertheless 
remains uncertain, because its detection is complicated by uncertainties 
in rates of snowfall and glacial isostatic adjustment. And the evolution 
and impacts of abrupt subglacial lake drainage events is poorly defined, 
because frequent measurements of ice elevation and flow changes are 
often lacking at the local scale. But understanding the thickness of sea 
ice across the Southern Hemisphere and the nature of ice shelf collapse 
and retreat are pressing concerns. Even though the range of parameters 
that can now be measured on grounded ice and on ice shelves may be 
considered comprehensive, available satellite observations are of insuf- 
ficient spatial and temporal sampling to fully understand the nature and 
evolution of the processes that are driving contemporary imbalance. 
Although the satellite altimeter record has been used to resolve Southern 
Ocean dynamics, determining sea ice thickness—a key measure 
of its volume and longevity—from measurements of its freeboard 
(the portion protruding above the ocean surface) are hampered by poor 
knowledge of snow loading and its impact on the satellite retrieval. 

In the case of Antarctic sea ice, uncertainties in the degree of radar 
penetration into the snowpack!**:!* has so far limited the use of the 
25-year radar altimeter record for measuring its thickness. Some 
advances have been made using laser altimetry?”!™, which scatters 
from the surface of the overlying snow, but continental-scale trends 
in Antarctic sea ice thickness and volume nevertheless remain elusive 
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owing to the paucity of in situ measurements. One way to tackle this 
problem is to exploit the relationship between the amount and rough- 
ness of snow on sea ice and its total thickness!¥°, an approach that may 
be realized with the launch of ICESat-2, which has a laser capable of 
detecting surface roughness and thickness!*”"*®. Another possibility is 
to combine freeboard measurements retrieved from different scattering 
horizons to estimate the snow load directly, for example using obser- 
vations acquired by the CryoSat-2 Ku-band and AltiKa Ka-band radar 
altimeters?"4° (Fig. 5) and, in the future, ICESat-2. New techniques 
are also emerging that will enable us to map the extent, type, age, drift 
and roughness of sea ice with fine resolution using synthetic aperture 
radar imagery. 

Over land ice, the record of ice sheet motion data are too sparse 
to determine whether changes in flow have occurred on sub-annual 
timescales across much of the continent. On this point, the outlook is 
promising thanks to the systematic acquisition plans of the Sentinel-1 
synthetic aperture radar and Landsat-8 optical imager missions. A 
key unanswered science question is how long it will take for the ice 
shelves that are currently thinning to reach a point whereby they are 
no longer providing effective buttressing for the grounded ice inland. 
To address this, observations are required with sufficient frequency 
to track the events themselves, which, in the case of ice shelf collapses, 
have taken place over months or even days'4. Although it is possible 
to monitor grounding-line migration with high precision using syn- 
thetic aperture radar interferometry*>**, the revisit period of satellite 
missions is currently too long for the technique to be effective over 
rapidly deforming ice, and so other methods—such as repeat satellite 
altimetry*®!*!—will need to be exploited to track this precursor to ice 
sheet dynamical imbalance. 

The past decade has been a golden era for satellite glaciology, with 
a host of different sensors in orbit simultaneously. However, measur- 
ing ice loss from Antarctica at the continental scale is today heavily 
reliant upon a single ageing mission—CryoSat-2—which, at 8 years 
old, has now more than doubled its planned lifetime, and the conti- 
nuity of passive microwave observations of sea ice concentration and 
extent remains uncertain. Given the societal importance of changes 
in ice cover and global sea level, support for current and planned 
observational platforms should remain a central goal of geoscience 
research—a goal that can only be achieved by concerted, long-term 
international collaboration. 
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Choosing the future of Antarctica 
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We present two narratives on the future of Antarctica and the Southern Ocean, from the perspective of an observer looking 
back from 2070. In the first scenario, greenhouse gas emissions remained unchecked, the climate continued to warm, and 
the policy response was ineffective; this had large ramifications in Antarctica and the Southern Ocean, with worldwide 
impacts. In the second scenario, ambitious action was taken to limit greenhouse gas emissions and to establish policies 
that reduced anthropogenic pressure on the environment, slowing the rate of change in Antarctica. Choices made in the 


next decade will determine what trajectory is realized. 


ntarctica, the most remote region on Earth, is intimately coupled 
to the rest of the climate system. Atmospheric and oceanic tele- 
connections communicate climate variations at low latitude to 
Antarctica and the Southern Ocean, influencing the polar atmosphere, 
ocean, ice sheet, sea ice and biosphere. Likewise, Antarctica and the sur- 
rounding Southern Ocean affect the rest of the globe. The amount and 
rate of sea level rise in the coming centuries depends on the response 
of the Antarctic Ice Sheet to warming of the atmosphere and ocean’. 
The Southern Ocean takes up more anthropogenic heat and carbon than 
the oceans at other latitudes, helping to slow the pace of atmospheric 
warming”’, The circulation of the Southern Ocean also sustains global 
marine productivity by returning nutrient-rich deep water to the surface 
and exporting nutrients to lower latitudes*. Given the profound influence 
of Antarctica and the Southern Ocean on sea level, climate, and marine 
ecosystems, change in the region will have widespread consequences for 
the Earth and humanity. From a political perspective, Antarctica and the 
Southern Ocean are among the largest shared spaces on Earth, regulated 
by the unique governance regime of the Antarctic Treaty System®, and 
embedded within and connected to broader global decision-making®”. 
We present, from the perspective of an observer in 2070, two narra- 
tives on 50 years of change in Antarctica. Each narrative highlights the 
long-term consequences of decisions made today. The 50-year times- 
cale reflects a period over which substantial differences between the 
two scenarios will develop and is within the lifetime of today’s children. 
In the first, no meaningful action was taken to mitigate greenhouse 
gas emissions and global warming continued unabated. In the sec- 
ond, aggressive measures were taken to limit emissions, restrict global 
warming and increase resilience. We consider the ‘high emissions/ 
weak action’ and ‘low emissions/strong action’ narratives to be likely 
upper and lower bounds on the future trajectory of Antarctica and 
the Southern Ocean. The trajectory that plays out over the next 50 
years depends on choices made today. Cumulative emissions of CO2 
largely determine global mean surface warming’, so continued growth 
in emissions soon commits us to further unavoidable climate impacts, 
even if some of those impacts take decades or centuries to emerge fully*. 
Greenhouse gas emissions must start decreasing in the coming decade 
to have a realistic prospect of following the low emissions narrative’. 
We provide an integrated assessment of the associated trajectories 
for Antarctica and the Southern Ocean, spanning physical, biological 


and social dimensions. These trajectories describe plausible alternative 
futures rather than forecasts. Where possible, such as for some physical 
and chemical variables, we use quantitative projections from climate 
and ice sheet models. Where this is not possible (as for many biolog- 
ical and social systems), we anticipate future change using a heuristic 
approach based on process understanding and the known response 
to past changes (noting that the heuristic approach risks neglecting 
important nonlinearities or surprises in natural systems'"’). 

Of necessity, we offer the perspective of a single observer on 50 years 
of change in Antarctica, but we acknowledge that given the global 
diversity of human experience and values, other observers would 
interpret these changes through a different lens. We do not assume 
that individuals, institutions or society will act in a certain way; rather, 
we assess the possible consequences of particular courses of action (or 
inaction). Our goal is to initiate discussion and consideration of options 
for Antarctica’s future, and to highlight how the future of Antarctica is 
tied to that of the rest of the planet and human society. 


Antarctica in 2070 under high emissions 

Looking back from 2070, it is clear that the past 50 years have 
unfolded much as anticipated by the high emissions (Representative 
Concentration Pathway (RCP) 8.5) scenario used by the 
Intergovernmental Panel on Climate Change (IPCC) Fifth Assessment 
Report! published more than half a century ago. Growth in energy and 
food demand by the increased global population, supplied predomi- 
nantly by intensive agriculture supported by fossil fuel use and associ- 
ated with deforestation, drove an ongoing acceleration of greenhouse 
gas concentrations in the atmosphere and an increase in effective radi- 
ative forcing of about 5 W m~? compared to the pre-industrial period!. 
Lack of action to mitigate greenhouse gas emissions was accompanied 
by lack of regulation of the human presence in Antarctica. Both dis- 
tant and local human activities have left an indelible footprint on the 
Antarctic and Southern Ocean environment (Fig. 1). 


Change in the physical environment 

After 50 years of continued high greenhouse gas emissions, global mean 
surface air temperatures over land are now more than 3.5°C higher than 
observed in the late nineteenth century’ (Fig. 2), well above the sym- 
bolic ‘guardrail of 2°C introduced in international climate agreements 
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after the 2009 United Nations Framework Convention on Climate 


Change meeting in Copenhagen and reaffirmed by the nations of the 
world in a landmark agreement signed in Paris in December 2015. 


Atmosphere. Air temperatures over Antarctica have warmed! by 
about 3 °C, well above the range of centennial variations of the current 
interglacial period'’. In some low-lying regions of the continent, this 
increase in temperature has been sufficient to cause surface melt in 
summer. The increase in summer melt has had widespread impacts, 
contributing to the collapse of ice shelves, exposing new ice-free areas 
open to colonization by native and introduced plants, and altering the 
area suitable for nesting penguins and other birds. 

During the 30 years between 2015 and 2045, recovery of the ozone 
hole tended to push the westerlies further north, while increases 
in greenhouse gas concentrations caused a shift to the south. As a 
result, spring and summer wind patterns changed little over this time 
period, in stark contrast to the strengthening and poleward shift of 
the westerlies seen in summer in previous decades'*. During the 
winter season, when the ozone hole had little impact on the winds, 
the westerlies began to shift polewards and intensify after 2010 in 
response to increasing greenhouse gas concentrations in the atmos- 
phere. By around 2045, greenhouse gases had won out over ozone 
recovery even in summer, forcing the westerlies to shift south and 
strengthen in all seasons'* 


Southern Ocean. Southern Ocean surface waters have warmed every- 
where, reversing a slight cooling observed in the early twenty-first cen- 
tury at high latitudes, with an average increase! of 1.9°C south of 50°S. 
Surface waters have also freshened in response to increased precipita- 
tion and melt of sea ice and glacier ice'*"!”. Warming and freshening 
of surface waters have increased stratification and inhibited exchange 
with nutrient-rich deep waters. At 1,000 m depth, warming north of the 
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Antarctic Circumpolar Current has exceeded ocean temperature rise 
observed at other latitudes, reflecting the efficient uptake of anthro- 
pogenic heat by the overturning circulation’. Antarctic Bottom Water, 
the volume of which had already contracted by 50% between 1970 and 
2014'8, no longer exists. As a result of freshening, water sinking near 
Antarctica is no longer dense enough to qualify as Antarctic Bottom 
Water, although continued sinking of less-dense water maintains deep 
oxygen levels’”. In response to ongoing changes in surface winds, the 
subtropical gyres have shifted polewards, effectively shrinking the 
Southern Ocean””, 

The combined impact of changes in winds, warming and 
freshening has slightly strengthened the upper limb of the Southern Ocean 
overturning circulation”!. The Southern Ocean therefore continues to 
take up and export large amounts of heat and carbon dioxide, helping 
to slow the increase in global surface temperatures. Although uptake 
and export of heat by the overturning circulation initially delayed 
warming around Antarctica, surface sea and air temperatures in and 
around Antarctica are now warming at about the same rate as the global 
average, with larger warming in winter than in summer’. 

The chemistry of Southern Ocean waters continued to change 
in response to rising levels of atmospheric CO). The pH of surface 
waters south of 60° S decreased by 0.2 between 2017 and 2070, equiva- 
lent to a 50% decrease in the concentration of hydrogen ions since the 
pre-industrial period’. Southern Ocean surface waters south of 
60° S became under-saturated with respect to aragonite in winter by 
2040, and by 2070, >30% of the Southern Ocean surface waters south 
of 40° S had become under-saturated year-round!”. Hence, waters have 
become corrosive to shells and other biological structures made of this 
form of calcium carbonate. 


Ice shelves. Wind-driven changes in ocean currents resulted in an 
increase in ocean heat transport to the Amundsen Sea in the late 
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twentieth and early twenty-first century”? . Warmer ocean waters enter- 
ing the cavities beneath floating ice shelves drove higher rates of basal 
melting, thinning of ice shelves, and a reduction in the back-stress on the 
grounded ice upstream. The reduced buttressing increased the flow of 
the ice streams feeding the ice shelves and led to the retreat of grounding 
lines, including runaway retreat of glaciers grounded on bedrock that 
deepened inland**”. 

Although some ice shelves had supported extensive surface 
melting for decades®, summer air temperatures are now high enough 
to increase surface melt on large areas of the floating ice shelves”?-*’. 
The increased volume of surface melt, coupled with an increase in the 
temperature of the surface firn due to persistent refreezing of meltwa- 
ter and associated release of latent heat, has now led to the collapse of 
several ice shelves through hydrofracturing, a process first observed 
in Antarctica 70 years ago during the 2002 collapse of the Larsen B Ice 
Shelf*”. The Larsen C Ice Shelf collapsed by hydrofracture in 2055°3 
following several decades of thinning*’, and then several consecutive 
summers of excessive summer melting. 

Most of the ice shelves in the Amundsen Sea have thinned at an 
accelerating rate owing to increased ocean temperatures that caused 
higher basal melt rates in the sub-ice cavities. The Venable, Crosson 
and Dotson ice shelves were all lost between 2040 and 2050, quickly 
followed by the Thwaites ice shelf in 2060, as anticipated in 2015 from 
trends measured by satellites between 1994 and 2012™. Nearly a quar- 
ter of the volume of Antarctica’s ice shelves has been lost in the past 
50 years*?. Loss of sea ice also contributed to a decrease in buttressing 
of ice shelves*’. Totten Glacier, an outlet for a large ice-sheet drainage 
basin in East Antarctica, has undergone thinning and retreat driven by 
warm water* accessing the grounding line. Changes in ocean currents 
have led to warmer ocean water entering ice shelf cavities (as predicted 
for the Filchner Ronne Ice Shelf*”), causing ice-shelf thinning and the 
retreat of the grounding line across ice streams identified previously 
as being particularly vulnerable to such change in both the West** and 
East Antarctic*’ ice sheets. 

The large number of icebergs produced by collapsing ice shelves all 
around Antarctica” is now carefully monitored to manage the risks to 
the greatly expanded fishing, tourism and commercial shipping fleets, 
and Antarctic operations by Antarctic Treaty nations. 


Antarctic Ice Sheet and sea level. Fifty years ago, mass loss from the 
ice-sheet margins of West Antarctica was partially compensated by 
mass gain due to increased snowfall over East Antarctica*)”, facili- 
tated by more frequent intrusions of marine air as the westerlies shifted 
south’. However, the increase in ocean-driven melting could not be 
balanced by enhanced accumulation after 2020, leading to unequivocal 
loss of mass from the Antarctic Ice Sheet. 

Observations of grounding line locations and ice stream velocities 
now confirm that the ‘marine ice sheet instability’ is well underway 
in West Antarctica, as first proposed in the 1970s”4 and supported 
by observations and modelling in 2014”°-*”. The loss of back-stress 
after the disappearance of ice shelves has led to increased flow of ice 
from the ice sheet to the ocean in both East and West Antarctica. 
Tall, unstable“ ice cliffs have begun to appear in places around the 
marine-terminating ice sheet margin, where ice shelves and glacier 
tongues that were more than 750 m thick at their grounding zones 
have been lost. Further collapse of the West Antarctic Ice Sheet is now 
irreversible”*?’, mainly through the rapidly retreating, 120-km-wide 
Thwaites Glacier. Mass loss from the Antarctic Ice Sheet has contrib- 
uted more than 27 cm of global sea level rise since 2000, as predicted 
decades earlier?**>. The rate of mass loss from Antarctica now exceeds 
5mm yr! (in sea level equivalent terms) and continues to accelerate. 
Antarctica now makes the largest contribution to the rise in global 
mean sea level, exceeding the contribution from thermal expansion, 
the retreat of mountain glaciers and melting of the Greenland Ice 
Sheet. The total rate of sea level rise is similar to rates during the last 
deglaciation (averaging 10-15 mm yr“')!. A commitment to multi- 
ple metres of sea level rise in the longer term is now irreversible, 
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consistent with early ice sheet model projections**°”” and sea-level 


reconstructions of past warmer worlds", including the Pliocene 
(3 million years ago, when atmospheric CO, concentrations were only 
400 parts per million by volume), and the most recent interglacial 
period (125 thousand years ago). High emissions over the past 
50 years have already committed us to more than 10 m of sea level rise 
in the longer term (that is, >3,000 years); if emissions continue on the 
present trajectory to a total of 5,120 Pg of carbon, we are committed 
to more than 50 m of sea level rise in 10,000 years, 80% of which will 
be contributed by the Antarctic Ice Sheet*”. Economic losses from 
the flooding of coastal cities already exceed US$1 trillion per year” 
as a result of the sea level rise of less than half a metre that occurred 
between 2000 and 2070!°. 


Sea ice 

Slow expansion of Antarctic sea ice in the decades preceding 2016 
encouraged, in some, a sense of complacency about the stability of 
Antarctic sea ice. Record low sea ice extent in 2016 served as a cau- 
tionary reminder that this past stability could not be taken for granted. 
Indeed, by 2045 Antarctic sea ice was in clear and sustained retreat. 
Winter sea ice extent has reduced by 40%, more than twice the retreat 
projected for 2070 by climate models’, consistent with the large sen- 
sitivity of winter sea ice extent to Antarctic warming inferred from 
palaeoclimate information*!. Summer sea ice extent has decreased by 
almost half and most of the continental margin is now regularly free of 
sea ice in February, enabling access by ship to previously inaccessible 
regions, increasing fishing activity and tourism. 


Change in biology and ecosystems 

Fisheries. As access to the Antarctic coast has become easier, Antarctic 
marine systems are being exploited by fisheries from many nations, 
with stocks in decline around the continent. While initial conservation 
actions, including new marine protected areas, provided some respite 
for toothfish (Dissostichus eleginoides and Dissostichus mawsoni)’, their 
long life cycles>’, illegal and unregulated practices*’, and increased 
interest in the species because of fishery collapses elsewhere™* quickly 
depleted their populations. Interest in the Antarctic krill Euphausia 
superba fishery has seen profound growth because of new technologies 
for product processing, and resource depletion elsewhere°’. Coupled 
with growing impacts of acidification on juvenile stages**, replacement 
of Antarctic krill by salps in many areas owing to sea ice changes”, 
and arise in the number of baleen whales” the fishery has substan- 
tially reduced stocks, at least as far as assessments show, though these 
have been compromised by the absence of an extensive research-based 
assessment””. In the ensuing scramble to make use of resources, the 
evidence-based management approaches implemented successfully 
by the Commission on the Conservation of Antarctic Marine Living 
resources (CCAMLR) were overwhelmed™®®, 


Predators. The decline of exploited fish and krill stocks affected 
the populations of predators, including declines of several popula- 
tions of penguin species®»’, though with complexity in some areas 
arising from declines in competition with exploited toothfish. Some 
Antarctic krill predators, such as black-browed and grey-headed 
albatrosses, have shifted their diet from Antarctic krill to mesope- 
lagic fish and squid in response, joining a large suite of myctophid 
feeders®, including fur seals®*. Fishery-related mortality of seabirds 
and seals has increased because of growth in mesopelagic fish and 
squid fisheries®’~®°. Overall, Southern Ocean fisheries are declining 
in value and their regulation is increasingly contentious, reflecting 
problems encountered globally in fisheries during the twentieth and 
twenty-first centuries. 


Community structure. Warming has also led to substantial changes 
in the composition of marine communities. In the decades following 
2017, regional trends of increase in some species and declines in others 
continued” 7!, More recently, unexpected regime shifts occurred, 
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caused by changing interactions among key species (such as between 
Antarctic krill, penguins, seals and baleen whales), catastrophic 
declines in some species, and in response to new phenomena such 
as transport of soil particles to the ocean by increased run-off of ice 
melt from the continent””?. Changes in resource availability further 
hastened Adelie and Chinstrap penguin range contractions*’”!. Gentoo 
penguins benefited initially, as early assessments predicted”*, but warm- 
ing to the north and fishery impacts have led to trailing-edge range con- 
tractions and the start of population declines in this species too. Ocean 
acidification further complicated community responses. While some 
species adapted to acidification (for example, Antarctic brachiopods’*), 
others were less able to do so (for example, pteropods”’), resulting in 
reorganization of communities”, compounded by changing ocean and 
atmospheric conditions”®. 


Terrestrial biota and invasive species. On land, melt and retreat 
of glaciers exposed new ice-free areas, particularly on the Antarctic 
Peninsula, the northernmost part of the continent”’. Antarctica’s only 
two native vascular plant species showed initial increases in populations 
and expanded their ranges widely on the peninsula, with Antarctic 
hair grass predominating*’. By 2050 this vegetation had come to 
include many other species too. Some are recognizable as widespread 
Antarctic invaders, especially the annual bluegrass or Poa annua, 
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Fig. 2 | Antarctica and the Southern Ocean in 2070, under ‘low 
emissions/high action’ (left) and ‘high emissions/low action’ 
(right) scenarios. Differences are relative to a 1986-2005 reference 
period. Differences (1) to (6) are atmosphere-ocean-ice differences, 
taken from model projections following low- and high-emissions 
scenarios, respectively. Data for differences (1), (3), (4) and (5) 

are from ref. !. Data for differences (2) and (6) are from ref. °°. 
Differences (7) to (11) are differences in ecosystem states and 
pressures, with ecosystem structure changing from the current 
situation to one characterized by new species and interactions. Data 
for difference (7) is from refs | and ”*; for (8) from refs *” and ®; for 
(9) from refs °”78 and |; for (10) from ref. © and for (11) from ref. °. 
The low-emissions scenario sees greenhouse gas mitigation adhered 
to, limiting global warming by 2070 to 0.9°C above the 1986-2005 
mean. The high-emissions scenario, in which no mitigation takes 
place, leads to 2.9°C of global warming by 2070 relative to the 1986- 
2005 mean, or 3.5°C relative to 1850-1900. The systems assessed 
are: (1) global average air temperature; (2) Antarctic contribution to 
global sea level; (3) Antarctic surface air temperature; (4) Southern 
Ocean surface temperature; (5) summer (February) sea ice extent; 
(6) Antarctic ice shelf volume; (7) ocean acidification (illustrated 

by a pteropod, a marine snail, with an aragonite shell subject to 
dissolution under acidic conditions); (8) level of alien species 
invasion; (9) ecosystem structure (under the low-emissions scenario 
the present ecosystem continues; under the high-emissions scenario 
some species, such as crabs, become established and other species 
shifts occur, such as from krill to salps, as the climate warms and sea 
ice retreats); (10) human presence; and (11) resource use. Each of 
these systems will continue to change after 2070, with the magnitude 
of the change to which we are committed being generally much larger 
than the change realized by 2070. 


already recorded from the peninsula in the early part of the century*'. 
Others are species that had long been predicted to colonize on the basis 
of analyses of seeds carried to Antarctica by scientific and tourist oper- 
ations®”. For other species, provenance remains obscure: they could 
have colonized naturally or through human agency. Debate of this con- 
tentious point at the Antarctic Treaty’s Committee for Environmental 
Protection (CEP) precluded action, resulting in their further spread. 
By 2070, many research stations and several new tourist hotels have, 
in consequence, developed manicured gardens. Settlements of perma- 
nent residents, including small numbers of migrants, have now become 
established to service the research and tourism industries®, and to con- 
trol invasive pests*. Invasive species management lessons learned in 
the late twentieth century from the sub-Antarctic islands seem to have 
been forgotten. 

Elsewhere on the continent, non-indigenous species have yet to 
establish populations®**. Yet, owing to discord in the CEP precipitated 
by rapidly changing conditions on the Antarctic Peninsula and many 
biotic invasions, attention to transfer of species among the continent's 
conservation biogeographic regions diminished***°. Molecular phy- 
logeographic studies of several groups, and especially the microbiota, 
were discontinued after investigations revealed that exchange of species 
and populations among Antarctic biogeographic regions had become 
virtually continuous after 2050°°. 
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Biodiversity conservation. Both in the Southern Ocean and on con- 
tinental Antarctica, action early in the twenty-first century led to 
improvements in biodiversity conservation, especially after attention 
was drawn to the potential for effective management to deliver rapid 
improvements in Antarctic biodiversity’. By 2050, however, many gains 
had started to suffer attrition. Establishment of protected areas slowed 
after an increase in the first two decades of the century, and protected 
area management failed to keep up with threats from growing human 
activity and resource exploitation®*’”. Moreover, funding of envi- 
ronmental protection to underpin decision-making declined owing 
to larger environmental degradation problems elsewhere, including 
those associated with rapidly rising sea levels**. Although the CEP con- 
tinues to meet on an annual basis, its recommendations are generally 
ignored at meetings of the parties to the Antarctic Treaty, unless they 
are relevant to the dominant issues of resource apportionment and 
management of expanding ice-free areas and widespread, land-based 
tourism. The special meeting held 50 years ago at the 40" Antarctic 
Treaty Consultative Meeting entitled ‘Our Antarctica: Protection and 
Utilization’ presaged this change. 


Change in human engagement with Antarctica 

Perceptions of the priorities for and efficacy of the Antarctic govern- 
ance regime and its global role varied widely among governments, 
NGOs and diverse members of civil society, and changed over time 
in concert with a rapidly changing global order. While the reaction 
of different actors to the growing evidence of dramatic change in the 
Antarctic environment therefore diversified, overall the past 50 years 
have been characterized by gradual erosion of the systems that safe- 
guarded the Antarctic environment in the late twentieth and early 
twenty-first century? 


Harvesting. Between 2017 and 2070, the global human population 
grew by >40%, from 7 billion to 10 billion®?. Owing to tremendous 
pressure for resources to support this larger and more prosperous pop- 
ulation, including a now overweight or obese majority”, Antarctica and 
the Southern Ocean are now even more widely explored or exploited. 
Most exploitation has occurred in the Southern Ocean, with a diversity 
of marine species harvested. At the CCAMLR, discussions now focus on 
resource apportionment among nations, rather than on an ecosystem- 
based approach to conservation®!. On the continent, harvesting is 
less noticeable. Ongoing legal battles over bioprospecting’ continue, 
however. Much of the pharmaceutical prospecting interest lies in 
discovering products that may manipulate human metabolism. 


Mining. In 2049, several nations attempted to rescind Article 7 of the 
Protocol on Environmental Protection to the Antarctic Treaty, which 
prohibits mineral resource exploitation. A majority of nations agreed, 
but the motion failed to obtain agreement of three-quarters of the 
states that were Antarctic Treaty Consultative Parties at the time of 
the adoption of the Environmental Protocol, a requirement of Article 
25. Thus, the Environmental Protocol remained unchanged. The 
early, clear division between the parties espousing a research for use 
approach and those more concerned with conservation’ largely disap- 
peared after 2048. Many nations started investigating resource potential 
and extraction technologies, thinly veiled under the guise of scientific 
exploration. Much of this ‘research’ activity now verges on extraction, 
with little political will to challenge these actions because of likely 
interference with geo-political complexities elsewhere on the planet®. 
Rapid technological developments in the Arctic have made resource 
exploration more affordable in polar regions, and global shortages of 
key minerals have driven investment in what has now come to be called 
‘scientific exploration. 


Tourism. Although a visit to Antarctica still remains the privilege ofa 
limited few compared with the global population, tourist numbers now 
exceed one million each year, reflecting rapid growth after an initial 
hiatus in the early twenty-first century”’. For many nations, tourism 
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revenue now provides the main source of funding for national Antarctic 
programmes, alongside partnerships with the fishing, pharmaceutical, 
food and minerals industries. Management of the Antarctic environ- 
ment is now similar to that of national parks elsewhere, where man- 
agers strike an uneasy balance between revenue generation, tourist 
numbers and biodiversity protection. Most of the global population 
now lives in cities and is less connected with the importance and mean- 
ing of the natural world (an extinction of experience”), resulting in loss 
of commitment to environmental concerns, including Antarctica. In 
consequence, programmes focused on a distant continent rather than 
on immediate surroundings attract little media attention, apart from 
the wide concern about Antarctica’s impact on global sea level rise. (A 
rearguard action to delay sea level rise by pumping seawater onto the 
Antarctic continent to be stored as ice, with power supplied by 850,000 
1.5-MW wind turbines, fell well short of the scale needed to make much 
difference to sea level rise™*). While the Antarctic Treaty System's var- 
ious agreements remain in place, they have weakened, leaving them 
vulnerable to other actors on a stage marked by regional rivalry rather 
than international cooperation. 


Antarctica in 2070 under low emissions 

Although the prospects for effective global action to mitigate emissions 
looked grim in 2015, the subsequent ratification of the United Nations 
Paris climate agreement by 196 countries, including the USA after some 
delay, heralded a new era of international cooperation to reduce green- 
house gas emissions. The faster-than-anticipated decrease in renewable 
energy and battery costs triggered a rapid transition out of coal. An 
increase in the magnitude and frequency of extreme climate events 
affecting major populations and economies highlighted widespread 
vulnerability and convinced decision-makers to increase their ambition 
to reduce greenhouse gas emissions, with the strong involvement of cit- 
ies, regions and business. As a result of these policies, amplifying carbon 
feedbacks were not triggered, and we are now on track to keep warming 
well below the 2° C target. New financial pathways helped create a 
functional and equitable carbon market, which incentivized business to 
transition rapidly to alow-carbon economy. Business leaders and fund 
managers began to appreciate the financial opportunities and other 
co-benefits of the transition associated with de-carbonization, and new 
technologies allowed for safe and efficient sequestration and ultimately 
removal of greenhouse gases from the atmosphere. As a result, radiative 
forcing has more or less followed the so-called RCP2.6 scenario con- 
sidered by the IPCC Fifth Assessment Report, with radiative forcing 
reaching a peak in about 2040 and with net fossil fuel emissions now 
negative'. Widespread recognition of the dangers of unrestricted use 
of fossil fuels inspired changes in consumption patterns in the devel- 
oped world, including shifts to more sustainable plant-based diets and 
changes in agriculture and land-use practices. The availability of low- 
cost renewable energy enabled developing countries to provide afforda- 
ble clean energy and alleviate poverty. Progress in meeting the challenge 
of climate change was accompanied by a renewed global commitment 
to the Sustainable Development Goals, most of which were achieved 
by 2035*°. Early action to reduce emissions allowed some costly adap- 
tation measures to be avoided (for example, the US$50 billion per year 
needed to protect coastal cities against flood losses®), freeing up funds 
for social goods such as improved healthcare and poverty reduction. 


Change in the physical environment 

The physical environment of Antarctica and the Southern Ocean 
remains similar in many respects to that of 50 years ago! (Fig. 2). 
Climate variability in Antarctica and the Southern Ocean continues 
to be dominated by the Southern Annular Mode, the high-latitude 
atmospheric response to ENSO”, and interactions between the two”. 


Atmosphere. Atmospheric trends observed in the decades before 2017 
were largely associated with changes in the Southern Annular Mode 
related to the ozone hole, in particular a southward shift and strength- 
ening of the westerly winds over the Southern Ocean, particularly in 
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summer’*. Decreases in emissions of ozone-depleting substances as a 
result of the 1989 Montreal Protocol led to gradual repair of the ozone 
hole and ozone levels in the Antarctic stratosphere have now returned 
to the values of the 1960s”*. Repair of the ozone hole and a stabilization 
of greenhouse gas concentrations in the atmosphere were accompanied 
by a gradual shift towards the Equator and weakening of the westerly 
winds in summer”, returning to values typical of the twentieth cen- 
tury’. Surface air and sea temperatures warmed by less than 1°C and 
precipitation slightly increased (<10%) over the ocean and interior of 
the Antarctic continent’. 


Southern Ocean and sea ice. Trends observed in temperature, salinity 
and circulation of the Southern Ocean in the late twentieth century 
and early twenty-first century slowed and ultimately reversed in the 
decades between 2020 and 2050. The return of the westerly winds to 
a position closer to the Equator was associated with a similar shift of 
the Antarctic Circumpolar Current and hence cooling in parts of the 
Southern Ocean. The overturning circulation continued to transfer 
anthropogenic heat and carbon dioxide effectively into the ocean inte- 
rior. Changes in wind-driven ocean currents reduced the exposure of 
the floating ice shelves to basal melt by warm ocean waters. However, 
the reduction in ocean heat transport to the ice shelf cavities came too 
late to save some West Antarctic ice shelves and ice tongues. Sea ice 
extent declined slightly (<15%) in both summer and winter between 
2015 and 2070’. 

Effective action to mitigate emissions has also slowed the rate of 
increase in acidity of the oceans. The pH of Southern Ocean surface 
waters stabilized in the 2040s at values about 0.15 below pre-industrial 
values, or 0.05 below values observed in 2015!. Southern Ocean surface 
waters remain super-saturated with respect to aragonite. The exposure 
of Southern Ocean biota to ocean acidification has therefore increased 
only marginally over the past 50 years. 


Ice shelves. While some ice shelves in the Antarctic Peninsula and 
Amundsen Sea were lost, the thinning rates observed in the large ice 
shelves for the period 1994-2012 remained fairly steady through to 
2070. The Totten, Amery and Larsen C ice shelves remain largely intact, 
undergoing normal retreat through several large iceberg calving events. 
The marine ice cliff instability**“, which glaciologists feared could 
become widespread by 2050, has mostly been limited to a few outlet 
glaciers in the Amundsen Sea sector of West Antarctica and has not 
reached East Antarctica. Persistence of pore space in the surface layer 
allowed more meltwater to be stored within the firn, decreasing the 
susceptibility to hydrofracture!™, 


Antarctic Ice Sheet and sea level. Although dynamic ice loss from 
marine-based sectors of the ice sheet has occurred, the rate of change 
is much less than the worst-case projections because many ice shelves 
continue to provide back-stress on the grounded ice**°. Mass loss from 
the Amundsen Sea sector of West Antarctica continued, contributing 
6cm of sea level rise between 2000 and 2070. After retreating steadily 
until 2050, the grounding zone of the Thwaites Glacier re-stabilized 
on a topographic feature about 25 km landwards of its early 
twenty-first-century position'®!, saving the West Antarctic Ice Sheet 
from further decay. In 2070, sea level rise continues to be dominated by 
contributions from ocean thermal expansion, glacier melt, and equal 
roles of the Greenland and west Antarctic ice sheets, as in the 2010s. 


Change in biology and ecosystems 

Value of Antarctica. Following the ratification of the United Nations 
Paris Climate Agreement, and the Santiago Declaration by the parties 
to the Antarctic Treaty!” to improve Antarctic and Southern Ocean 
environmental management, a sea change swept across the Antarctic 
Treaty System. The CCAMLR embraced the reality that climate change 
and harvesting were simultaneously threatening the Antarctic ecosys- 
tem. In consequence, barriers to establishment of Marine Protected 
Areas” were dismantled. Systematic conservation planning’, based 
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on recognition of evolutionary potential and genetic connectivity, ena- 
bled a flexible approach to maintenance of populations and helped 
achieve the CCAMLR goal of conservation of the Southern Ocean 
ecosystem. 


Marine ecosystems. Local change in the marine system continued 
along trajectories recorded early in the twenty-first century’*-7>”8, 
with reversals of earlier trends recently becoming apparent. Widely 
forecast tipping points were averted. New monitoring approaches!™*! 
provided opportunities to identify locations where such threshold shifts 
might be important and to design strategies to avert them, such as 
temporary ecosystem manipulation’. Although ocean acidification 
continued, the impacts stabilized following the decrease in atmos- 
pheric CO; levels after 2040. Some population declines were recorded 
in sensitive species”’, but others adapted, resulting in less change 
than was initially forecast”!. Seal and seabird populations continued 
to show changes in their foraging range, and changes in body mass 
and breeding success®”*+, However, stabilization of sea ice conditions, 
and a reduction in the frequency of the extreme events to which these 
species are sensitive!©’, reduced the rate of change. For some species, 
such as wandering albatross Diomedea exulans, weakening in the 
strength of the westerlies reversed a trend of enhanced breeding suc- 
cess and larger body mass associated with the strengthening westerlies 
earlier in the century’. 


Terrestrial ecosystems. On land, spread of the two indigenous vascular 
plant species on the Antarctic Peninsula continued, but at a declining 
rate by the late 2060s. Changes predicted for the Dry Valleys associated 
with pulse events’ continued, but with declining importance after 
2060. The Long Term Ecological Research sites proved exceptionally 
important as a source of evidence to document climate-associated bio- 
diversity responses. Rapid progress in remote sensing techniques!®, 
alongside adoption of a suite of essential biodiversity variables, enabled 
researchers to verify that changes in terrestrial ecosystems were within 
the bounds anticipated for the low-emissions scenario considered by 
the IPCC Fifth Assessment Report!. 


Invasive species. The introduction of invasive species along the 
Antarctic Peninsula initially continued as had been expected. However, 
none of the world’s most invasive species have established, largely 
because the climate remained inhospitable*’. Several European and 
sub- Antarctic species were able to gain a foothold and spread initially, 
as had been predicted from increasing human activity in the region®. 
Nonetheless, two developments reduced the magnitude of the problem. 
First, the CEP adopted a systematic, DNA barcoding and web-based 
surveillance system, which enabled rapid identification of ‘unusual’ spe- 
cies found by environmental managers, and appropriate action. Second, 
the parties to the Antarctic Treaty agreed two regulatory frameworks 
in quick succession in the years leading up to 2030. First, an agreement 
on the importance of Antarctic genetic resources and bioprospecting 
was put in place as a further Annex to the Environmental Protocol, thus 
resolving a decades-long impasse’. The renewed focus on the value of 
indigenous resources improved conservation actions to retain them, 
including the declaration of a suite of new terrestrial protected areas. 
Second, a Protocol on Tourism Regulation, initially proposed in the 
earlier years of the century° was eventually agreed as a further Annex to 
the Environmental Protocol. The Annex regulated all Antarctic activity, 
recognizing the similarity in impacts of science, tourism and resource 
extraction activities (the agreement excluded fishing). These develop- 
ments also prompted further attention to components of diversity not 
frequently considered, yet of tremendous consequence in the Antarctic 
terrestrial system—the microbiota'!!. The parties to the Antarctic 
Treaty agreed by resolution to apply strict biosecurity measures on 
travel among the Antarctic Conservation Biogeographic Regions®. 
Thus, evolutionary biology continued to document the unusual nature 
of Antarctic systems and their evolutionary history. The outcomes of 
this work provided valuable insights into the microbial history of the 
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planet and the way systems might evolve elsewhere. Moreover, biopros- 
pecting activities delivered new products that improved the health and 
wellbeing of human populations. 


Change in human engagement with Antarctica 

Effective governance. Strong action by the international community 
to mitigate greenhouse gas emissions was echoed in Antarctica, where 
the parties to the Antarctic Treaty reversed decades of regulatory inac- 
tion°. Decisive steps were taken to limit the impact of increased human 
engagement with Antarctica. Motivated by a clearer appreciation of 
the threats to the region and the global value of better understand- 
ing Antarctica and its links to lower latitudes, the parties reaffirmed 
the commitment to maintain Antarctica as a natural reserve for peace 
and science'” and, importantly, strengthened a variety of governance 
measures to ensure the commitment was put into practice. Although 
national implementation of measures remained variable, reflecting the 
complexity of interactions and feedbacks between international law 
and domestic legislation and politics® , the tenor of the international 
governance conversation had changed. 


International collaboration. Perhaps most important was the 
improved relationship between the Antarctic Treaty System and the 
United Nations, especially through its environment programme. The 
very cool relations sparked in the 1980s by the ‘Question of Antarctica, 
which remained on the United Nations’ General Assembly Agenda 
until 2005", started to thaw some two decades later. Chief among the 
catalysts was research showing that Antarctica and the Earth system 
are inseparable. Moreover, an increasing focus on truly global action 
to achieve the Sustainable Development Goals” helped improve col- 
laboration. Warming relationships between international institutions 
were also reflected by better collaboration between CCAMLR and 
other regional fisheries management organizations’, to the benefit of 
the latter. In consequence, regulatory activities across both marine and 
terrestrial environments improved and an integrated biodiversity strat- 
egy’ facilitated holistic management of the region. These achievements 
by the Antarctic Treaty System have provided a compelling illustration 
of the power of effective management of shared international spaces. 
The governance of Antarctica is now taught in schools worldwide as an 
example of successful international cooperation in conservation and 
sustainable resource management. 


Societal benefits. Human engagement in Antarctica continued to 
deliver societal benefits on a global scale. New compounds isolated 
from Antarctic biota are now used in a wide variety of industrial and 
medical applications (an advance made possible by the Environmental 
Protocol’s new Annex to ensure fair and effective management of 
intellectual property associated with biological prospecting). The 
skills of weather, climate and sea level forecasts for the entire Southern 
Hemisphere now depend heavily on the automatic observing systems 
established in Antarctica and the Southern Ocean. Lessons learned 
in redesigning Antarctic bases and logistics to increase efficiency 
and minimize environmental impact helped accelerate progress to a 
low-carbon global economy. These advances included improvements 
in building efficiency, renewable energy and storage, electric vehicles, 
waste management and autonomous systems. 


Conclusion 

We have described two retrospective narratives for Antarctica set 
in the year 2070: one in which greenhouse gas emissions continued 
to increase rapidly and little policy action was taken to respond to 
environmental and social factors affecting Antarctica, and a second 
in which strong action was taken to reduce emissions and to put in 
place effective policies to enhance the resilience of Antarctica. The two 
scenarios are of course highly speculative and intended as counterfac- 
tual catalysts for discussion, rather than as predictions of the future. 
One thing is certain, however: the narrative that eventually plays out 
will depend substantially on choices made over the next decade’. 
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Antarctica and the Southern Ocean are closely coupled to the rest 
of the globe. The Antarctic Ice Sheet is the largest and most uncertain 
potential contributor to future sea level rise. In addition, changes in 
high southern latitudes will also directly affect the energy budget of the 
Earth by altering the planetary albedo, the strength of the global over- 
turning circulation, the amount of carbon dioxide in the atmosphere, 
and the availability of nutrients to support marine life. 

Under the high-emissions scenario, Antarctica and the Southern 
Ocean undergo widespread and rapid change, with global conse- 
quences. But the environmental change realized by 2070 will be only 
a fraction of the change to which we are committed by choices made 
today, and the rate of change will have increased and continue to 
accelerate. For example, once initiated, the marine ice sheet instability 
will result in irreversible loss of large parts of the ice sheet resting on 
bedrock below sea level. Under the low-emissions scenario, in which 
global average temperatures remain within 2°C of 1850 values, there 
is some chance that the buttressing ice shelves will survive and the 
Antarctic contribution to sea level rise will remain below 1m. Under 
the high-emissions scenario, the ice shelves are lost and Antarctica con- 
tributes 0.6 m to 3 m of sea level rise by 2300, with an irreversible com- 
mitment® of 5m to 9m, or as much as** 15 m in the coming millennia. 

Despite the challenges, actions can be taken now that will slow the 
rate of environmental change, increase the resilience of Antarctica, and 
reduce the risk of out-of-control consequences. An effective response to 
the challenges of a changing Antarctica can serve as an example of the 
power of peaceful international collaboration, as well as demonstrate 
how integration of physical, biological and social sciences can enable 
decision-making that is informed by the past and takes account of the 
long-term consequences of today’s choices. 
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fungi, across European forests at a spatial scale and resolution that is—to our knowledge—unprecedented, to explore key 
biotic and abiotic predictors of ectomycorrhizal diversity and to identify dominant responses and thresholds for change 
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on ectomycorrhizal diversity, and define thresholds of community change for key variables. We quantify host specificity 
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host factors explain most of the variation in ectomycorrhizal diversity, that the environmental thresholds used as major 
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The main projected effects of environmental change on forest processes 
stem from global and regional perturbations in the carbon (C) and 
nitrogen (N) cycles!”, and declines in soil biodiversity*. Globally, myc- 
orrhizal mutualisms mediate soil processes in terrestrial ecosystems° 
and are major drivers of ecosystem carbon and nitrogen dynamics®. 
Soil carbon sequestration”, tree population dynamics’ and mitigation 
of CO; fertilization'® have recently been linked to ectomycorrhizal 
symbioses, which are ubiquitous drivers of photosynthetic carbon 
exchange for soil nutrients across temperate and boreal forests’’. 
How changes in ecosystem processes are underpinned by ectomyc- 
orrhizal fungi is poorly understood, but probable large-scale effects of 
those changes—for example, deteriorating tree mineral nutrition and 
health—are currently being observed!”’?. Various ecological processes 
are apparent only at large spatial scales’, and there is concern about 
the lack of baseline ectomycorrhizal distribution data against which to 
assess the effects of global change’>!®. Ectomycorrhizal research has 
emphasized laboratory or local-scale studies that are often reliant on a 
few species of culturable fungi to provide mechanistic understanding 
of symbiotic physiology. However, determinants of ectomycorrhizal 
diversity at local scales are not necessarily their primary drivers at larger 
scales’’, and ectomycorrhizal communities are often dominated by 


hardly culturable and non-fruiting or inconspicuously fruiting fungi'®. 
Furthermore, ectomycorrhizal community composition, richness, fine 
root biomass and morphology'®-”! and fungal aboveground fruiting” 
indicate different large-scale patterns and responses from plants and 
animals, and species richness increases with sample area more for 
ectomycorrhizal fungi than it does for microbes'””?. Consequently, 
there have been repeated calls for unbiased, large-scale, molecular 
and ecosystem-level baseline data on ectomycorrhizal fungi!>!8?%*, 
Elucidating large-scale ectomycorrhizal diversity is crucial for appro- 
priate experimental design in ecosystem science and model organism 
selection for experimental and comparative biology”. 

Unlike multiple local-scale studies in which ectomycorrhizal fungi 
have been found to be strongly determined by soil environment”*”’, 
recent large-scale biogeographical studies report that—other than 
host identity—soil, climate and atmospheric deposition explain only 
a limited amount of ectomycorrhizal variability*** (Supplementary 
Table 1). Even though specialists can be widespread, most ectomycor- 
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Fig. 1 | Map of Europe showing sampled level II plots from the United 
Nations Economic Commission for Europe International Co-operative 
Programme on Assessment and Monitoring of Air Pollution Effects on 
Forests (UNECE ICP Forests). Polygons depict outer boundaries of the 
sampled area for each host tree species. 


statistical analysis of large-scale standardized datasets or studies of low 
pollution sites!®?>°, Critical loads are essential tools for international 
atmospheric emissions control*”**, but for ectomycorrhizal fungi 
they differ markedly between Europe and North America®. In addi- 
tion, ectomycorrhizal physiological and morphological plasticity are 
thought to enhance soil nutrient uptake of trees across environmental 
gradients”; however, foraging-related functional traits are assumed to 
be fixed at species or genus levels. Wide gradients with abundant obser- 
vations are needed to link plasticity and environment. 

We conducted a detailed mycorrhizal analysis using one of the 
world’s largest and most intensive long-term monitoring networks of 
soil, atmospheric and vegetation parameters. We analysed 38 variables 
at 137 plots in 20 European countries across strong environmental gra- 
dients. We expected to (1) disentangle significant variability explained 
by co-varying climatic, soil and atmospheric deposition factors, 
(2) test the generality of host specificity, (3) detect precise thresholds of 
mycorrhizal change to inform environmental policy and (4) infer trait 
plasticity linked to key environmental gradients. 


Results 

We examined 29,664 ectomycorrhizas from 9,888 soil cores from 
103 plots of approximately 0.25 ha in 18 European countries. We also 
included data from 34 plots from two previous studies!®!8, which 
resulted in a dataset of 39,621 ectomycorrhizas from 137 plots in 20 
countries covering an area of approximately 5.5 million km? (Fig. 1). 
After removing short low-quality (12,038), chimaeric (231), non- 
mycorrhizal (848) and unknown (1,308) internal transcribed spacer 
DNA sequences, we retained 25,196 sequences that produced 1,406 
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ectomycorrhizal fungal operational taxonomic units (OTUs), of which 
82% were Basidiomycota and 18% were Ascomycota (Fig. 2); 914 OTUs 
were recorded more than once, and 90% of them were identified to 
genus or a higher taxonomic level. Of this 90%, 47% were identified 
to a species level. 


Composition and specificity 

We explained 38% of the variance in community composition with 
forward-selected variables according to the Akaike information 
criterion. Variables were divided into four partitions: host variables, 
soil + deposition, climate and geographical distance (Supplementary 
Table 2). Nine host variables together explained most of the overall 
community variance (23%), followed by soil + deposition (21%), 
geographical distance (14%) and climatic variables (12%). The 
partitions shared 20% of the overall explained variance (Fig. 3). 

We used global non-metric multidimensional scaling ordinations 
to visualize ectomycorrhizal fungal community composition, and 
we fitted environmental variables to the ordination to find the most 
influential variables (Extended Data Fig. 1 and Extended Data Table 1). 
Thus, we identified five key variables for subsequent analyses: nitrogen 
throughfall deposition (Nrgp), forest-floor pH, mean annual air 
temperature (MAT), potassium throughfall deposition (Krpp) and 
foliar nitrogen:phosphorus ratio (N:P,). 

Almost two-thirds (62%) of ectomycorrhizas correspond to fungi 
that produce aboveground mushroom-like fruitbodies; the rest produce 
inconspicuous truffles, crusts or sclerotia. Based on abundance, 48% of 
ectomycorrhizas were formed by generalists and 52% were specialists 
to coniferous or broad-leaf hosts. Only 7% of ectomycorrhizas were 
formed by specialists to one host tree species. Of the 88 OTUs that 
formed 50 or more ectomycorrhizas, 36 (41%) were generalists and 
52 (about 60%) were coniferous or broad-leaf specialists; 11 OTUs 
(12.5%) were specific to one host species. 


Indicators, thresholds and plasticity 

Threshold indicator species analyses identified decreasing (z—) 
and increasing (z+) indicator OTUs for all five key environmental 
variables (Fig. 4 and Extended Data Fig. 2). We identified environ- 
mental thresholds of ectomycorrhizal fungal community change by 
cumulating z— and z+ change points. For Nygp, we found a sum(z—) 
peak at 5.8 kg nitrogen per ha per year (kg N ha~! yr~!) and a sum(z+) 
peak at 15.5 kg Nha“ yr~’. For N:Pp, we detected peaks at 10.2 and 
13.3 for sum(z—) and sum(z+), respectively. We found a sum(z—) peak 
at 6.9 kg Kha™! yr~! and an indistinct sum(z+) peak at 21.7 kg K ha! 
yr! for Krpp. There was a distinct peak for forest-floor pH for both 
sum(z—) and sum(z-+) at 3.8. Indicator OTUs showed a clear thresh- 
old of change for MAT, with a 7.4 °C z— peak and a distinct 9.1 °C z+ 
peak. Most z— OTUs for Nrpgp, N:P potassium deposition, forest-floor 
pH and MAT were conifer specialists, whereas all z+ OTUs were 
generalists or broad-leaf associates. Generally, threshold values based 
on accumulated change points of individual taxa were less pronounced 
at the genus than at the OTU level (Extended Data Fig. 3). 

The observed frequencies of ectomycorrhizas with emanating 
hyphae and those with rhizomorphs differed significantly between 
tree species (P < 0.0001, d.f. = 3) and soil types (P < 0.0001, d.f. =5; 
Extended Data Table 2a, b); frequencies of ectomycorrhizas with ema- 
nating hyphae were higher than expected with beech and spruce, and in 
Fe-Al soils. Thirty of the 88 most-abundant OTUs (>50 ectomycor- 
rhizas) showed morphological plasticity and 26 of them were also indi- 
cators for a key environmental variable. The change in morphology of 
17 of these ectomycorrhizal taxa was significantly related to at least one 
environmental variable (Extended Data Tables 3a, 4a). Morphological 
plasticity related to at least one variable was found within 12 OTUs 
when a more stringent 99% sequence similarity was used (Extended 
Data Tables 3b, 4b). Intraspecific plasticity of individual indicator 
ectomycorrhizal fungi does not necessarily follow overall community 
morphological changes, for which logistic regressions showed that 
mean Np was positively related with hyphal presence (P < 0.0001). 
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There was a negative correlation between hyphal presence and 
forest-floor pH, N:P and Kypp, but no correlation with MAT (Extended 
Data Table 5). Community-wide, we found negative correlations 
between rhizomorph presence and all tested environmental variables 
(Extended Data Table 5). 


Discussion 

To our knowledge, this is the first large-scale high-resolution study of 
diversity and distribution of belowground tree symbionts that covers all 
major European climatic regions for the most abundant tree species. We 
explain considerable large-scale mycorrhizal diversity with an excep- 
tional range of high-quality environmental, host-related, climatic and 
geographical variables. We identify large-scale environmental predic- 
tors, show the dominance of host specificity, determine environmental 
indicators and new thresholds of change, and reveal morphological 
plasticity along environmental gradients. These findings serve as a 
baseline to assess future change and resilience. 

Host-related, soil and atmospheric deposition variables were the 
most important predictors of ectomycorrhizal community struc- 
ture across Europe. Four recent large-scale studies”*?!~*? found 
these variables to be minor predictors, although in local-scale stud- 
ies, soil environment shows strong effects”°?”. We distinguished five 
key environmental variables: Nygp, N:P, forest-floor pH, Krpp and 
MAT. Across previous large-scale studies, there is agreement that host 
species and soil pH are important, but results concerning other vari- 
ables disagree (Supplementary Table 1). Inconsistent large-scale driv- 
ers of diversity and abundance have previously been reported across 
different microbes”, but host is also fundamental for prokaryotes at 
macroecological scales*’. Environmental effects on ectomycorrhizal 
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Fig. 2 | Krona chart of taxonomic 
affiliation of ectomycorrhizas 
and their relative abundance. 
Inner circles represent higher 
taxonomic ranks, and more 
detailed taxonomic ranks (up to 
species level) are presented in 
outer circles. A full interactive 
version of this chart is available 

in Supplementary Fig. 1. Am., 
Amanitaceae; Bo., Boletaceae; 

Ca., Cantharellaceae; Cl., 
Clavulinaceae; Co., Cortinariaceae; 
Hyd., Hydnangiaceae; Hyg., 
Hygrophoraceae; Im., Imleria; In., 
Inocybaceae; Sc., Scleroderma; 
Scl., Sclerodermataceae; 

Th., Thelephoraceae; Tr., 
Tricholomataceae; Xe., 
Xerocomellus. 
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fungi in previous studies have probably been confounded by: (1) envi- 
ronmental variables from modelled or extrapolated regional sources; 
(2) non-standardized sampling and spatial pseudo-replication; (3) indirect 
assignment of mycorrhizal status and traits using databases (for exam- 
ple, UNITE (https://unite.ut.ee/), FunGuild (http://www.stbates.org/ 
guilds/app.php) or DEEMY (http://www.deemy.de/)); (4) semiquanti- 
tative analysis of short DNA sequences; and (5) pooling DNA samples 
from root hyphae, soil hyphae and dormant propagules even though 
ectomycorrhizal spore banks differ strongly from active communi- 
ties on roots at local and large scales**, and ephemeral aboveground 
reproductive structures and soil hyphae correspond weakly with active 
communities on roots**“. Asa result, up to 90% of variation in ecto- 
mycorrhizal diversity at large scales has remained unexplained by 
environmental models*’. The approach used here is considered more 
robust* and generates higher quality data*®, but had yet to be scaled 
up owing to technical challenges. The large unexplained part of com- 
munity structure may be attributed to factors that were not accounted 
for, such as disturbance, management history, stochasticity, interactions 
among variables masking individual effects, measurement and 
analytical errors, exclusion of rare species, seasonality, using taxonomic 
instead of functional diversity, and/or not covering complete gradients 
of each variable across whole geographical ranges of hosts and fungi. In 
our study, conifers have a larger distribution and thus cover larger envi- 
ronmental gradients; this difference in distribution probably explains 
the different number of environmental variables linked to community 
dissimilarities among hosts. 

Host-related variables strongly influence ectomycorrhizal fungal 
communities, thus symbiosis has a major role in shaping ectomycor- 
rhizal distributions. Studies on the host specificity of ectomycorrhizal 
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Fig. 3 | Variation-partitioning Venn diagram. This diagram shows the 
percentages of individual contributions of host variables (host species, 
foliar chemistry and defoliation), soil + deposition variables, climatic 
variables and geographical distance. The percentage of variance explained 
by multiple partition models is shown where ellipses overlap. Values 

in brackets show the total percentage of variance explained by the four 
partitions. Residual variance represents the percentage unexplained by the 
four partition models. 


fungi at large scales have mainly been based on fruitbody surveys and 
thus assess specificity on taxonomic rather than abundance levels*’. 
Host generalism is considered the rule*’, but intensive belowground 
analysis indicates ectomycorrhizal fungal specificity to the most- 
common European trees matches or exceeds generalism on taxonomic 
and relative abundance levels, particularly for conifers. We find more 
conifer specialists and they respond strongly to environmental gradients; 
the implications of specificity and abundance merit investigation as 
they can reflect, respectively, more**”” and less” efficient nutritional 
mutualisms. 

We use threshold indicator taxon analyses for the first time, to our 
knowledge, for fungi at a continental scale to identify distinct ectomy- 
corrhizal responses to key environmental variables and clear thresh- 
olds of change. Indicator species emerged for all key environmental 
variables, and several ectomycorrhizal taxa were indicators for more 
than one variable. Different fungi within a family, and even a genus, 
can be both positive and negative indicators for a variable; for instance, 
Thelephora terrestris and Tomentella castanea are negative and posi- 
tive indicators for N:P, respectively, and Lactarius rufus and Lactarius 
hepaticus are negative and positive indicators for Nrpp, respectively. 
Nonetheless, genus-level analyses revealed most indicator species 
patterns hold true at higher taxonomic ranks (Extended Data Fig. 3). In 
some genera, the aggregate of species acts as indicator, although indi- 
vidual species do not (for example, Sistotrema, Clavulina and Boletus for 
Nrep and Kygp). For several genera we find a different response to ele- 
vated Nrpp than previous studies, even those with consistent responses 
across studies”? (that is, Tomentella, Tylospora, Cenococcum, Hebeloma 
and Amanita). Furthermore, we confirm the response to elevated 
Nrrp of several genera that has previously only been recorded in a few 
studies*? (that is, Clavulina, Elaphomyces, Boletus and Amphinema). 

With increasing nitrogen availability, metabolically costly ways of 
obtaining nitrogen from complex soil organic sources are less cost- 
effective; fungi that use these pathways (for example, Cortinarius, 
Piloderma and Tricholoma) are at a disadvantage compared to fungi 
that use inorganic nitrogen (for example, Elaphomyces and Laccaria)*». 
Indeed, fungi that use organic nitrogen tended to be negative indicators 
for nitrogen deposition, and fungi that use inorganic nitrogen tended 
to be positive indicators. 

Some indicator species for Kypp are abundant and widespread in 
Europe (for example, Elaphomyces asperulus, Lactarius quietus and 
Piloderma sphaerosporum); however, Kypp has not been identified as 
a key variable in previous ectomycorrhizal studies. A meta-analysis 
showed that in 69% of experiments tree growth responded posi- 
tively to soil potassium increases*', but potassium is highly diffusible 
in soil and easily accessible for plants. Some Krpp may originate 
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Fig. 4 | Threshold indicator taxa analyses. a, Individual OTU abundances 
in response to Nygp. Black symbols show taxa declining with increasing 
Nrep (z—), open symbols depict increasing taxa (z+). Symbol size 

is proportional to magnitude of response (z-score). Horizontal lines 
represent 5th and 95th quantiles of values resulting in the largest change in 
taxon z-scores among 1,000 bootstrap replicates. Tree shapes indicate host 
generalist, conifer- or broad-leaf-specific. b, Community-level output of 
accumulated z-scores per plot is shown in response to Nygp. 


from canopy leaching; with acidifying pollution, potassium leaches 
and—if depleted in foliage and litter—potassium availability in 
soil organic matter could decrease. Moreover, potassium is taken 
up and translocated by ectomycorrhizal fungi in a specific manner 
(for example, ectomycorrhizal fungi with hydrophobins transfer 
less potassium)!!. This is consistent with our results; most negative 
indicator genera were hydrophobic and most positive indicator genera 
were hydrophilic*”. 

Based on the large number of indicator species for MAT, climate 
should have an important role in shaping ectomycorrhizal communities, 
as suggested by fruiting phenology studies**. However, it is difficult to 
distinguish MAT from climate and therefore to know whether a fungus 
occurs somewhere because of prevalent temperatures. Nevertheless, 
current habitats may become less favourable for many ectomycorrhizal 
fungi as temperature increases. 
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Accumulated change-point values of all individual ectomycorrhizal 
fungi indicate environmental thresholds of change for most key envi- 
ronmental variables. There was a narrow range for fungi negatively 
affected by Nrgp with a sharp threshold at 5.8 kg N ha7! yr7!. These 
mainly conifer specialists thrive in poor soils and pre-industrial nitro- 
gen levels (approximately <2 kg N ha7! yr7'), but cannot keep up with 
increased Nyxp from industrial, agricultural and transport emissions 
over the past decades. They are probably outcompeted by fungi that use 
the additional inorganic nitrogen or avoid additional nitrogen uptake 
costs™, particularly within the temperate distribution ranges of beech 
and oak where N rrp is greatest, and organic nitrogen users show some 
recovery in fruiting if nitrogen pollution decreases”. Positively affected 
fungi—which are mostly host generalists that lack proteolytic abilities— 
initially do well with additional inorganic N, giving them a competitive 
advantage. However, the much broader response range and less defined 
peak at 15.5 kg N ha! yr7 for these fungi suggests that adaptation by 
positively affected fungi to increased Nrpp varies greatly. This might be 
driven by geographically divergent population-level evolutionary selec- 
tion pressures on fungi since the Industrial Revolution. Furthermore, 
naturally enriched microsites (for example, animal latrines, carcasses 
and disturbances) and macrosites (for example, stands with N) fixers) 
could have pre-adapted certain fungi to high post-industrial levels of 
nitrogen. 

We confirm and extend observations based on fruitbodies and roots 
at smaller scales” that conifer specialists—particularly those with 
abundant hyphae and rhizomorphs—are more negatively affected 
by increasing nitrogen than generalists and broad-leaf specialists. 
The strong differences observed in host specificity between fungi 
negatively and positively affected by Nygp may be caused by differ- 
ences in their enzymatic capability to acquire nitrogen directly from 
complex soil organic compounds (thus circumventing mineralization) 
and in resource exchange rate, for example, if specialists transfer more 
soil nitrogen per unit of tree carbon than generalists**. Comparative 
genetic, physiological and ecological studies of the different sets of 
dominant indicators are now needed to test alternative models of 
ectomycorrhizal community optimization versus parasitism under 
changing carbon and nitrogen conditions” through species replacement, 
plasticity and/or evolution™. 

Large-scale belowground analysis contributes important information 
on ecosystem assessment tools for a uniquely important guild of forest 
organisms. Critical loads for eutrophying nitrogen deposition were 
previously estimated for ectomycorrhizal fungi at 5-10 kg Nha™! yr~* 
for North America® and 10-20 kg N ha™! yr7! for Europe”, largely 
based on expert opinion and aboveground data. Thresholds based 
on European ectomycorrhizal data have focused on a few sites across 
smaller gradients, or ectomycorrhizal richness and evenness instead 
of community composition!®**>, Our large nitrogen-deposition gradi- 
ent leads to a much lower European threshold value for a substantial 
ectomycorrhizal shift at 5-6 kg N ha’ yr~’, based on both throughfall 
and open field deposition data, approaching recent lower estimates for 
other forest organisms™*!. Caution is needed when inferring absolute 
values for critical loads, but based on our results the critical loads for 
European forests require adjustment towards those for North American 
forests, and ectomycorrhizal and forest change thresholds need aligning 
to explain the alarming deterioration in European tree nutrition)’. 
Critical N:Py are considered to be plant specific®? and N:P; has been 
linked to tree health, with breakpoint values of 7.3 for conifers and 
14.8 for broad-leaf trees with respect to defoliation’”. We show that 
lower (10.2) and upper (13.3) N:Pp thresholds for ectomycorrhizal 
communities are linked to conifers and broad-leaf trees, respectively. 
Community threshold forest-floor pH levels for negative and positive 
indicator species overlap. Although soil pH is anthropogenically influ- 
enced (for example, through liming) and soil acidification affects parts 
of Europe®, the major soil pH differences across forests arise from 
soil parent material and climatic differences over long timescales, and 
must have long been an influence on ectomycorrhizal communities. 
Nonetheless, individual species could be affected by changing pH levels. 
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For Kp, no threshold values for ectomycorrhizal composition have yet 
been published. We identify a 5-8 kg K ha~' yr~’ threshold for declin- 
ing species; however, Kypp results partly from potassium uptake and 
leaching by trees, which may be influenced by ectomycorrhizal fungi 
themselves. Therefore, research into potassium deposition and cycling 
is needed for ectomycorrhizal communities"! and forests’. 

Physiological and morphological heterogeneity and plasticity of 
ectomycorrhizal mycelium have previously been considered to be 
responsible for enabling trees to rapidly take up soil nutrients®>®; here 
we show morphological plasticity within dominant ectomycorrhizal 
taxa and changes over environmental gradients. This has important 
implications for functional diversity studies at large scales and/or across 
gradients. Indirect assignment of ectomycorrhizal functional traits to 
taxonomic groups merits caution and temporal variation of these traits 
merits investigation. 

We conclude that intensive and extensive organismal and environ- 
mental data collection, with multiple biotic and abiotic co-varying 
factors, reveals that soil, atmospheric deposition and climate variables 
control large-scale patterns of species distributions in ectomycorrhizal 
communities. Such data enable the linking of species and community 
responses to environmental thresholds acting across macroecological 
scales and deliver new insights into spatial variation in specificity and 
functional trait plasticity below the ground. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0189-9. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Sampling and processing. Since 1995, the International Co-operative Programme 
on Assessment and Monitoring of Air Pollution Effects on Forests (http://icp- 
forests.net/)® has been intensively monitoring about 800 plots (level II) in major 
forest ecosystems across Europe”. Their extensive in situ data better reflect the 
local environmental conditions of plots than regional modelled or extrapolated 
data®. These level II plots of at least 0.25 ha and located within homogenous forest 
stands are structurally diverse and cover a representative mixture of European 
managed forest types (ranging from plantations to natural regenerating forests)”. 
European forests are dominated by Scots pine, Norway spruce and European beech 
(60% of the European Union forest area), with the next three most-common tree 
species together covering 10%. We selected all ICP Forests level II plots in which 
deposition, meteorology, foliar chemistry, soil and preferably soil solution data are 
measured simultaneously, and between September 2013 and September 2015 we 
sampled plots with European beech (Fagus sylvatica L.; n = 35), Norway spruce 
(Picea abies (L.) H. Karst; n = 36) or Scots pine (Pinus sylvestris L.; n = 32) as the 
dominant (>50% abundance) tree species. We combined these with previously 
published additional data that were similarly collected from Scots pine level II 
plots!’ (n = 12) and pedunculate and sessile oak (Quercus robur L. and Quercus 
petraea (Matt.) Liebl)!* (n = 22), to give a widespread coverage of European forest 
areas (Fig. 1). 

We used Sanger DNA sequencing of the full internal transcribed spacer (ITS) 
amplicon from individual ectomycorrhizas to maximize resolution of identifi- 
cations, obtain relative abundance data and link DNA sequences directly to 
morphology, following previously published standardized sampling protocols!*!8, 
In brief, on each plot (n = 137) 24 trees of the investigated target tree species were 
randomly selected and from those trees a transect was made to the nearest tree 
of the target species, then four soil samples (25-cm deep, 2-cm diameter) were 
collected at equal distances on each transect. When plots contained multiple tree 
species, areas with non-target tree species were avoided. Soil samples were stored 
at 4 °C for up to ten days until they were processed. Roots from each soil core were 
rinsed on a 0.5-mm sieve, and mycorrhizal roots were collected for five minutes 
using a dissecting microscope. Subsequently, from each soil sample, an individual 
mycorrhiza was sampled from the three longest roots, resulting in 288 mycorrhizas 
per plot. Morphological characteristics of each mycorrhiza were recorded, includ- 
ing presence/absence of emanating hyphae and rhizomorphs, and turgor to assess 
activity. Genomic DNA from individual mycorrhizas was obtained using Extract- 
N-Amp (Sigma-Aldrich), and the ITS region of the nuclear rDNA was amplified 
using ITS1F”° and ITS4”! primers. Amplicons were purified using ExoSAP-IT 
(USB) and sequenced bidirectionally using BigDye3.1 with an ABI 3730 DNA 
Analyzer (Applied Biosystems). 

Environmental data. On the level IJ plots various environmental long-term 
measurements (average 14 years) were carried out using national protocols based 
ona harmonized methodology” (see Supplementary Table 2). Soil types were 
classified in ten types: Andosols, Arenosols, Calcisols, Cambisols, Leptosols, 
Podzols, Regosols, Umbrisols, soil types characterized by an Argic B horizon (that 
is, Luvisols and Alisols), and soils with gleyic properties (that is, Gleysols and 
Stagnosols)®”*. While maximizing the number of plots without missing values 
(n = 108), we selected available data—including forest age, level of defoliation”’, 
geographical coordinates and elevation—along with soil (eight variables) and foliar 
(seven variables of investigated tree species)’* data, atmospheric throughfall depo- 
sition chemistry (wet and dry under forest canopy deposition, eleven variables)” 
and meteorology (six variables)””. 

Bioinformatics. We used Phred’* to obtain base quality scores (Q) for both 
forward and reverse DNA sequences from all individual mycorrhizas, including 
previously published DNA sequences!©®. The two sequences obtained from 
each mycorrhiza were assembled in Geneious (version 8.1.8)’?, with the De novo 
Assemble tool. We used Trimmomatic® to remove low-quality bases (Q < 20) 
at either end of the sequences and then discarded short reads (<100 remaining 
bp). We then used the uchime_ref tool in vsearch*! to match chimaeric sequences 
against the UNITE reference database (version 7.1, 22/08/2016). 

We used the usearch_global tool in vsearch to identify remaining DNA 
sequences with a percentage match > 97% to UNITE 7.1 species hypotheses®”. 
From the remaining unmatched sequences, we first removed all sequences with 
ambiguous base pair codes and then used the cluster_fast tool in vsearch, to identify 
de novo OTU clusters. The unmatched sequences were then matched to the 
centroids of these de novo clusters; sequences were accepted with a percentage 
identity > 97% and the remaining sequences were discarded. 

We used three sources of information for each de novo centroid to confirm 
the identification of the fungal sequences and to provide tentative classifications. 
First, we examined the ten best alignments from BLAST searches*? of the GenBank 
nucleotide database. Second, we trained RDP Classifier®* against the UNITE 7.1 
database and then classified the de novo centroids against the trained database. 


ARTICLE 


Third, we used vsearch to obtain the best match of each centroid to the UNITE 
7.1 species hypotheses. 

Finally, we checked the ectomycorrhizal status of all OTUs by comparing the 
taxonomic classification based on UNITE with the literature***°, When OTUs 
assigned in UNITE to a species hypothesis were identified to a taxonomic level 
that includes both ectomycorrhizal and non-ectomycorrhizal fungi (for example, 
Agaricomycetes species), we retrieved the taxonomic names associated with all 
UNITE DNA sequences within that species hypothesis to assess the level of uncer- 
tainty in the classification of the species hypothesis. We discarded de novo OTUs 
with less-resolved classification: (1) those with a classification that was distant from 
known ectomycorrhizal fungi, (2) those where the root tip morphology suggested 
possibly dead plant or fungal tissue, and (3) those which were based on relatively 
short sequences (<150 bp). The set of identified ectomycorrhizal fungal sequences 
was then used to construct an abundance matrix of OTUs across sites. We used the 
Hellinger transformation of proportion abundance*” in subsequent analyses. Host 
specificity of abundant OTUs (>50 ectomycorrhizas) was established by scoring 
occurrence at plots with the different tree hosts. The OTUs occurring with one 
host tree species in a plot were considered strictly specific and OTUs occurring 
with both coniferous and broad-leaf species, or with more than two tree species 
were considered generalists. 

Statistical analysis. We used R (version 3.3.3) for statistical analyses and gener- 
ating figures*®, 

To quantify the importance of host variables, soil and deposition chemistry, 
climate and geographical distance on ectomycorrhizal fungal community compo- 
sition, variances were partitioned following previous publications*””°. Explanatory 
variables describing plot and tree characteristics were grouped in the following 
partitions: (1) host (host species, foliar chemistry and defoliation), (2) soil and 
deposition chemistry (soil characteristics and throughfall deposition), (3) climate 
(climatic region, MAT, precipitation, growing season length, minimum and maxi- 
mum annual temperatures, and elevation) and (4) geographical distance (excluding 
elevation). The most-relevant variables in each partition were found through for- 
ward-selection model-building with the redundancy analysis method based on the 
Akaike information criterion and P < 0.05 using ordistep in the vegan package”!. 
Geographical distances are the great circle distances, calculated using the mean 
Earth radius between the minimum and maximum latitude of plots in this study 
(r = 6,365 km) with rdist.earth in the fields package”. Great circle distances are 
commonly used in large-scale macroecological studies to approach real distances 
between sampling sites’***, The geographical distance matrix was transformed to 
rectangular data by extracting spatial vectors with principal coordinates of neigh- 
bour matrices (PCNM) using pcnm (vegan). To build the geographical distance 
model, PCNM vectors accounting for autocorrelation were extracted (P < 0.05) 
using Moran (Ictools package)”® and forward selected. Variation partitioning was 
carried out for the 108 plots with the selected environmental data using varpart 
(vegan). 

Global non-metric multi-dimensional scaling ordinations were used to explore 
and visualize the main factors that affect ectomycorrhizal fungal community 
composition with metaMDS (vegan). Environmental variables (Supplementary 
Table 2) were fitted to the ordination plots using envfit (vegan). Ordinations 
were performed for the 108 plots with the selected environmental data. To limit 
co-linearity effects between variables, we selected key environmental variables 
from the envfit results with R? > 0.4 and P < 0.01. In case of correlations (r > 0.7) 
between those variables, the most-commonly measured environmental variable 
(Supplementary Table 1) was selected: Ngp, forest-floor pH, MAT, Krep and N:P». 

Indicator species for the key environmental variables were detected and their 
threshold values were calculated using threshold indicator species analyses 
(TITAN2)*°. The sums of the indicator species scores of all OTUs were used to 
detect lower and upper ectomycorrhizal community thresholds for key environ- 
mental variables. In addition to Nrgp, we also obtained ectomycorrhizal commu- 
nity thresholds for nitrogen open field deposition, because open field deposition 
measurements better reflect the data that is available in spatially mapped deposition 
datasets”, 

G-tests were performed to test whether host species or soil type influence hyphal 
and rhizomorph presence or absence. We used logistic regression with each key 
environmental variable and the presence or absence of emanating hyphae and 
rhizomorphs within individual OTUs to test for environmental influences on their 
morphological plasticity. We considered OTUs for which the indicator analysis 
suggested a response to a particular environmental variable and, for statistical 
power, we only tested OTUs with > 15% presence and > 15% absence of emanating 
hyphae or rhizomorphs (Extended Data Table 1). Target tree species and soil type 
were used as co-variates, to account for potential variation in hyphal and rhizo- 
morph development in mycorrhizas belonging to the same OTU among different 
tree species and different soil types. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 
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Code availability. R scripts for data analyses are available from the corresponding 
author upon reasonable request. 

Data availability. Sequencing data generated during the current study are available 
through DRYAD (https://datadryad.org/) with the DOI https://doi.org/10.5061/ 
dryad.cr70qc8. Morphological characteristic and host specificity data are available 
from the corresponding author upon reasonable request. All environmental data 
(including deposition, foliar chemistry, soil and meteorological data) are available 
from UNECE ICP Forests (http://icp-forests.net/page/data-requests). Restrictions 
apply to the availability of these data, which were used under license for the current 
study. Data are available from the corresponding author upon reasonable request 
and with permission of UNECE ICP Forests. 
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Extended Data Fig. 1 | Global non-metric multidimensional scaling show the direction and strength of correlation of the most-influential 
ordination of community composition. Plots shown with host trees: environmental variables according to their R? values (>0.4). A, MAT; 
brown squares, beech; blue circles, oak; green triangles, pine; yellow B, mean minimum annual air temperature; C, growing season length; 
diamonds, spruce. Isoclines depict the forest-floor pH and arrows D, NH, throughfall deposition; E, Nrpp. 
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Extended Data Fig. 2 | Threshold indicator taxa analyses. 

a, c, e, g, Analyses of individual OTU abundances in response to N:P 
(a), forest-floor pH (c), Kypp (e) and MAT (g). Black symbols correspond 
to taxa declining with the increasing variable (z—) and open symbols 
depict increasing taxa (z+). Symbol size is proportional to magnitude 

of response (z-score). Horizontal lines represent 5th and 95th quantiles 


of values resulting in the largest change in taxon z-scores among 1,000 
bootstrap replicates. Tree shapes indicate host generalist, conifer- or 
broad-leaf-specific. b, d, f, h, Community-level output of accumulated 
z-scores per plot is shown in response to N:Py (b), forest-floor pH 

(d), Kren (f) and MAT (h). 
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Extended Data Fig. 3 | Threshold indicator taxa analysis at the genus 
level. a, c, e, g, i, Analyses in response to Np (a), N:Pr (c), forest-floor 
pH (e), Krep (g) and MAT (i). Black symbols correspond to taxa that 
declined with the increasing variable (z—) and open symbols depict 
increasing taxa (z+). Symbol size is proportional to magnitude of response 


resulting in the largest change in taxon z-scores among 1,000 bootstrap 
replicates. b, d, f, h, j, The community-level output of the accumulated 
z-scores per plot is shown in response to Nrgp (b), N:Pr (d), forest-floor 
PH (f), Krep (h) and MAT (j). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


Extended Data Table 1 | Envfit results for the environmental variables used in the non-metric multi-dimensional scaling ordination 


NMDS1 NMDS2 R? P-value 

VECTORS: 

Latitude 0.95129 0.3083 0.2462 0.001 
Longitude 0.22325 0.97476 0.3762 0.001 
Defoliation -0.56313  -0.82637 0.0178 0.379 
Foliar N -0.71074  -0.70345 0.5719 0.001 
Foliar K -0.77982  -0.626 0.3201 0.001 
Dep tf pH -0.98433  -0.17634 0.3938 0.001 
MAT -0.48638  -0.87375 0.4263 0.001 


Min annual air temperature -0.43426 -0.90079 0.4749 0.001 
Max annual air temperature -0.35748 -0.93392 0.1344 0.002 


Growth season length -0.38123 -0.92448 0.4845 0.001 
Soil pH CaCl. -0.37544 0.92685 0.322 0.001 
Forest floor pH CaClz -0.85397 0.52032 0.4833 0.001 
Forest floor total organic C 0.17461 0.98464 0.02 0.366 
Forest floor total organic N= -0.9989 -0.04683 0.1181 0.003 
Elevation -0.40274 0.91532 0.2059 0.001 
Foliar P 0.48318 0.87552 0.0265 0.234 
Foliar N:P ratio 0.67447 0.7383 0.4888 0.001 
Foliar Ca -0.86227 0.50645 0.4524 0.001 
Foliar Mg 0.89284 -0.45037 0.3697 0.001 
Foliar S -0.80043 -0.59943 0.511 0.001 
Dep tf conductivity -0.31098 -0.95042 0.3403 0.001 
Dep tf K -0.79468 -0.60703 0.4463 0.001 
Dep tf Ca -0.99963 -0.02728 0.165 0.001 
Dep tf Mg -0.63371 -0.77357 0.1937 0.001 
Dep tf Na 0.27266 0.96211 0.3241 0.001 
Dep tf NH, -0.29206 -0.9564 0.4298 0.001 
Dep tf Cl 0.26542 0.96413 0.3687 0.001 
Dep tf NO; -0.36813 -0.92977 0.2465 0.001 
Dep tf SO, 0.29501 0.95549 0.2802 0.001 
Dep tf N total -0.30962 -0.95086 0.4363 0.001 
Mean annual precipitation 0.98576 0.16817 0.0484 0.074 
Soil total organic C -0.46567 -0.88496 0.0507 0.07 
Soil total organic N -0.87811 -0.47846 0.0963 0.007 
Forest floor dry mass 0.78238 -0.62281 0.1256 0.001 
Mean tree age -0.93483 0.35511 0.1855 0.001 
FACTORS: 

Tree species 0.5262 0.001 
Soil type 0.2556 0.001 


Climatic region 0.3125 0.001 


Significant variables are shown in bold. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


Extended Data Table 2 | Observed and expected frequencies of hyphae and rhizomorph presence 


a 
Hyphal presence Rhizomorph presence 
beech pine spruce beech pine spruce 
Observed 4451 3767 4709 195 443 258 
Expected 4525 3777 4636 313 262 321 
b 
Hyphal presence Rhizomorph presence 


S1 S2 $3. S4 S5_ S6 S1  S2 S3 S4 S5 S6 
Observed 4412 1067 6064 232 622 490 363 47 373 42 31 40 
Expected 3978 1159 6204 258 644 644 277 81 431 18 45 45 


a, Hyphal and rhizomorph presence per host tree species. b, Hyphal and rhizomorph presence per soil type. S1, Fe-Al soils; S2, clay soils; S3, soils with little or no differentiation; 
S4, salt accumulation soils; S5, organic accumulation soils; S6, limited root soil. 
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Extended Data Table 3 | Effects of key variables on hyphal plasticity 


a 
Forest 

OTU n Ntrp N:Pe floor pH MAT Ktrp 
Amphinema byssoides 1 129 0.06 0.19° 
Atheliaceae sp 3 72 0.61 0.35" 
Cortinarius caperatus 55 0.64 ¥ <0.05 0.061" 0.061" 
Elaphomyces asperulus 347 0.52 0.24 0.62 0.16 0.27 
Elaphomyces granulatus 519 4 <.0001" 1 <0.001° 
Elaphomyces muricatus 1 438 0.14° 0.57" 
Imleria badia 115 0:32" 0.74 
Laccaria sp 7 138 0.117 0.677 4 <0.05" 0.94" 0.8" 
Lactarius blennius 159 0.94* 0.25° 
Lactarius hepaticus 98  <0.01" ¥ <0.01 =} <.0001° 
Lactarius subdulcis J <0.05° 1 <0.01° 0.67" 
Meliniomyces bicolor 108 0.38" 0.59" 0.96 0.39" 
Piloderma bicolor 102 0.27 0.19° 0.19" 0.75 0.23" 
Piloderma sp 7 56 4 <0.05° 0.054 0.23" <0.01° <0.01° 
P. sphaerosporum 1051 4 <.0001° H <.0001° 4 <.001 <.0001° <.0001° 
Russula cyanoxantha 1 78 0.086" <0.05" 0.28" 0.26" 0.09° 
Russula ochroleuca 834 <0.01" 0.12 0.96" 
Russula sp 14 225 0.078" J <.001" 0.12*° + <0.05" 
Thelephora terrestris 64 0.74 0.78 
Tomentella albomarginata 100 0.17 0.075 0.47* 
Tomentella castanea 78 0.093° 0.22° 0.064" 0.34° 
Tylospora asterophora 355 $<0.001 4<0.05 0.62  <0.05 
Tylospora fibrillosa 555 0.23" 0.28 0.82 
Xerocomellus pruinatus 106 0.28" 0.51" 0.08" 0.23" 0.48" 
b 

Forest 

OTU n Ntep N:P- floor pH MAT Ktep 
Amphinema byssoides 1 119 0.086" 0.17 
Atheliaceae sp 3 72 0.66 0.22 
Elaphomyces asperulus 332 0.16 4 <0.05° 0.94 4 <0.01° 0.20 
Elaphomyces granulatus 538 4 <.0001° : t <.0001° 
Elaphomyces muricatus 1 376 0.33 0.19 
Imleria badia 86 0.22* 0.89" 
Lactarius blennius 132 0.75" 0.71" 
Lactarius hepaticus 86 4 <0.01" ¥ <0.05 4 <0.001" 
Lactarius subdulcis 241 <0.05" <0.01° 0.75° 0.35° 0.94° 
Piloderma sphaerosporum 869 I <.0001 i <.0001° 4 <0.01° 4 <.0001° 4 <.0001° 
Russula cyanoxantha 1 72 0.48" 0.054* 0.059" 0.217 0.06" 
Russula ochroleuca 861 4 <0.05° 0.079" 0.86" 
Russula sp 14 116 0.87" 4 <0.01" 0.84" 0.064* 
Thelephora terrestris 54 0.86 0.94 
Tomentella castanea 76 ¥ <0.05" 0.076" 0.2" 0.55" 
Tylospora asterophora 325 4 <0.01° 0.081" 0.64 0.21 
Tylospora fibrillosa 71 4<0.05° 0.27 0.23" 
Xerocomellus pruinatus 63 0.32° 0.11° 0.58" Orr 0.37" 


a, OTUs with 97% sequence similarity. b, OTUs with 99% sequence similarity. Values with P < 0.05 are shown in bold. Logistic regressions were calculated only for OTUs for which the indicator 
analysis suggested a response to a particular environmental variable. Values with superscript — refer to a declining indicator (z—); values with superscript + refer to an increasing indicator (z+). 
A downward-pointing arrow denotes a negative correlation; an upward-pointing arrow denotes a positive correlation. 
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Extended Data Table 4 | Effects of key variables on rhizomorph plasticity 


a Forest 
floor 

OTU n Ntrp N:Pr pH MAT Ktep 
Amphinema byssoides 129 0.22 0.42 
Imleria badia 115 0.87° 
Piloderma bicolor 102 0.28 0.89 0.25 0.25 0.75 
P. olivaceum 409 ¥<0.01' 0.32 0.45 0.24 0.24 
P. sp 36 58 0.073 ¥<0.05 0.17 ¥<0.05° 0.17 
P. sphaerosporum 1051 0.21 0.76 %<0.05° 0.27 0.94 
Xerocomellus pruinatus 106 +<0.01" 0.117 4<0.05" 0.60" 4<0.01" 
b Forest 

floor 

OTU n Ntep N:PF pH MAT Ktep 
Amphinema byssoides 119 0.29" 0.28 
Imleria badia 86 0.40° 
P. olivaceum 126 +¥<0.05 0.64 0.94 0.0517 0.39 
P. sp 36 58 0.095" 0.093 0.20 ¥<0.05° 0.24 
P. sphaerosporum 869 0.21 0.54 0.0817 0.46 0.83 
Xerocomellus pruinatus 63 0.058" 0.31* ¥<0.05" 0.77° J4<0.05* 


a, OTUs with 97% sequence similarity. b, OTUs with 99% sequence similarity. Values with P <0.05 are shown in bold. Logistic regressions were calculated only for OTUs for which the indicator 
analysis suggested a response to a particular environmental variable. Values with superscript — refer to a declining indicator (z—); values with superscript + refer to an increasing indicator (z+). 
A downward-pointing arrow denotes a negative correlation; an upward-pointing arrow denotes a positive correlation. 
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Extended Data Table 5 | Effects of key variables on hyphal and rhizomorph presence on the total ectomycorrhizal community 


Forest 
Ntrp N:P- floor pH MAT Ktrp 
Hyphae 4<.0001  }<.0001 4 <.0001 0.26 ¥ <0.05 
Rhizomorphs 4<.0001 4<.0001 4 <0.001 4<.0001 4<.0001 


Values with P < 0.05 are shown in bold. Values with superscript — refer to a declining indicator (z—); values with superscript + refer to an increasing indicator (z+). A downward-pointing arrow denotes 
a negative correlation; an upward-pointing arrow denotes a positive correlation. 
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Visualizing late states of human 40S 
ribosomal subunit maturation 


Michael Ameismeier?, Jingdong Cheng!?, Otto Berninghausen! & Roland Beckmann!* 


The formation of eukaryotic ribosomal subunits extends from the nucleolus to the cytoplasm and entails hundreds of 
assembly factors. Despite differences in the pathways of ribosome formation, high-resolution structural information 
has been available only from fungi. Here we present cryo-electron microscopy structures of late-stage human 40S 
assembly intermediates, representing one state reconstituted in vitro and five native states that range from nuclear to 
late cytoplasmic. The earliest particles reveal the position of the biogenesis factor RRP12 and distinct immature rRNA 
conformations that accompany the formation of the 40S subunit head. Molecular models of the late-acting assembly 
factors TSR1, RIOK1, RIOK2, ENP1, LTV1, PNO1 and NOBI provide mechanistic details that underlie their contribution 
to a sequential 40S subunit assembly. The NOB1 architecture displays an inactive nuclease conformation that requires 
rearrangement of the PNO1-bound 3’ rRNA, thereby coordinating the final rRNA folding steps with site 3 cleavage. 


Ribosomes are universally conserved macromolecular complexes that 
translate mRNA into protein. In eukaryotes, they consist of a small 40S 
and a large 60S subunit that together comprise four rRNAs and about 
80 ribosomal proteins. During their assembly, a multitude of over 200 
trans-acting ribosome biogenesis factors (RBFs) ensures correct cleav- 
age, modification and folding of rRNA and concomitant incorporation 
of ribosomal proteins! ?. 

In eukaryotes, ribosome biogenesis starts in the nucleolus with 
the transcription of three of the four rRNAs as a primary transcript*. 
After dissociation of the large subunit precursor, pre-40S particles are 
exported to the cytoplasm, where final maturation processes occur’. 
Many proteins are involved in the later stages of human small subunit 
maturation, several of which we observe in this work, including the 
methyltransferase BUD23 (also known as WBSCR22) together with 
TRMT112, the armadillo (ARM)-like protein RRP12, structural pro- 
teins ENP1 (also known as BYSL) and LTV1, the endonuclease NOB1 
with its binding partner PNO1, GTPase-like protein TSR1, as well as 
the atypical kinases RIOK1 and RIOK2°*. 

High-resolution structures of pre-40S particles from fungi such as 
Saccharomyces cerevisiae have recently been published*'°. Despite 
differences in rRNA processing"! and the composition and function 
of RBFs!*°, however, human ribosomal precursors have been only 
described at low resolution so far", limiting the information to overall 
positioning of RBFs. 

Here we report cryo-electron microscopy (cryo-EM) reconstructions 
of several small ribosomal subunit precursors that provide detailed 
insights into late human 40S maturation principles. As a reoccurring 
concept, we found that assembly factors stabilize rRNA segments in 
distinct immature conformations before allowing their accommodation 
into their respective mature positions. Thereby, accurate step-wise and 
sequential maturation of rRNA from 5’ to 3’ is ensured, and premature 
engagement of 40S precursors in 80S formation and erroneous trans- 
lation is prevented. 


Cryo-EM analysis of the human pre-40S ribosome 
To understand better the maturation process of human 40S subunits, we 
aimed at structurally analysing native complexes, purified with PNO1 


as bait (states A-E), and reconstituted pre-40S particles in vitro (state 
R). Single particle cryo-EM analysis (Extended Data Fig. 1) resulted in 
a series of pre-40S structures that represent six different middle to late 
assembly states (Fig. la and Extended Data Fig. 2a, b). 

In state A, we found the RBFs RRP12, ENP1 and PNO1 together with 
the methyltransferases BUD23 and TRMT112. State B lacks BUD23 
and TRMT112, but comprises LTV1 and ribosomal proteins uS2, uS5 
and eS21, as well as the endonuclease NOB1. From state C on, RRP 12 is 
dissociated, and its place is occupied by RACK1. In addition, the kinase 
RIOK2, and ribosomal proteins eS12 and eS31 have bound, revealing 
remarkable conservation when compared with highly similar pre-40S 
subunits from yeast™’? (Extended Data Fig. 2d). In state D, ENP1 and 
LTV1 have been replaced by ribosomal proteins uS3, uS10, eS10 and 
uS14. Finally, an unassigned protein (factor X) is present in state E, 
partially occupying the binding site of the C-terminal helix of RIOK2. 

The atypical kinase RIOK1 has also been described to have a role in 
the final steps of pre-40S maturation, together with PNO1 and NOB1’. 
We did not observe such a state in our native pull-outs and, therefore, 
attempted to reconstitute it in vitro using purified mature 40S subunits 
and recombinant proteins (state R). We found that PNO1 and RIOK1 
have displaced ribosomal proteins eS26 and eL41, and that NOB1, PNO1 
and the pre-18S rRNA in state R adopt a highly similar conformation 
as in state E, including an immature helix 44 (h44) and a shifted 3’ end. 

Among all pre-40S reconstructions, two (states C and R) could be 
refined to average resolutions below 4 A. A full model of state C was 
built (Fig. 1b and Extended Data Fig. 2c), including biogenesis factors 
TSR1, NOB1, PNO1, ENP1 and RIOK2, as well as parts of LTV1, and 
a short segment of RIOK1 was built in state R (Extended Data Fig. 2e). 
The average resolution of the other states was between 4 and 4.5 A, with 
considerably lower local resolution in flexible areas. It was, however, 
sufficient to allow for unambiguous rigid body fitting of our models 
and assignment of RRP12, BUD23 and TRMT112 to extra densities in 
states A and B (Fig. 1b). 


The maturation process of the 3’ major domain 
The maturation process of the central region around helices h35-h40 
is dominated by large shifts of rRNA, in which RRP12 has a crucial 
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Fig. 1 | Structural analysis of human pre-40S particles. a, Cryo-EM 
reconstructions of five native pre-40S particles, obtained via N-terminal 
tandem affinity purification (N-TAP), and the mature 40S with a 
precursor-like complex reconstituted in vitro (see Methods). Names, 


role (Fig. 2 and Extended Data Fig. 3a). In state A, it coordinates h35- 
h37 and the region around the three-way-junction of h38-h39-h40 
in their immature position, and with h41 and uS9 parts of the head 
(Fig. 2a, top). The flipping of helices h35-h37 downwards by about 90° 
then allows binding of the uS2-uS5-eS21 cluster together with NOB1 
(Fig. 2a, middle and Extended Data Fig. 3b). Once RRP12 is released, 
h39 rotates almost 140° around the h34 axis to interact with h41, while 
h40 and eS17 accommodate close to their mature position in state C 


Side view Side view 


MStateC \/ J 
Mature 40S 


Fig. 2 | RRP12 binding and sequential pre-18S rRNA rearrangements 
during head formation. a, Movement of h35-h37 and h39 during early 
head formation. Volumes filtered at 6 A. b, c, Detailed view on rRNA 
rearrangements. b, The transition from state B to C is described by a 140° 
movement of h39 and h40. c, Final maturation requires repositioning of 
h34, h39 and h40. 
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average resolution and changes in protein composition are stated below. 
b, Models of state A (left) and C (right) 40S precursors with ribosomal 
biogenesis factors and h44 highlighted (grey: ribosomal proteins, light 
brown: rRNA). bk, beak; hd, head; If, left foot; pt, platform; rf, right foot. 


(Fig. 2a, bottom, and Fig. 2b). With the movement of these central 
rRNA helices, the interface is then formed for binding of RACK1. In 
yeast, Rrp12 has been shown to facilitate export of pre-40S particles 
through direct interaction with nucleoporins and the small GTPase 
Gsp1p*>. In combination with our results, this suggests that RRP12 
could serve in a quality control step, in which correct head formation 
and subsequent export are coupled. 

Despite major rRNA rearrangements, helices h34, h39 and h40 are 
yet to settle into their mature position (Fig. 2c), which depends on mat- 
uration events within the beak (Extended Data Fig. 3c). In our earlier 
particles, ENP1 and LTV1 occupy the central position on the beak, 
preventing the accommodation of the uS3-uS10-eS10-uS14 cluster 
and the mature positioning of h34, similar to what has been observed 
in yeast”. After the release of ENP1 and LTV] from the beak in state D, 
ribosomal proteins uS3, uS10, eS10 and uS14 settle into their respective 
sites, and this coincides with a movement of h34 closer to its mature 
position (Extended Data Fig. 3d). 


The maturation process of the decoding centre 

We observed an immature decoding centre in all native complexes and, 
to our surprise, also in native 40S particles treated with PNO1, NOB1 
and RIOK1 at high molar excess (20x; Extended Data Fig. 4a). Similar 
to yeast®, h44 is shifted outward by approximately 25 A in states A-E 
and 11 A instate R (Fig. 3a). This displacement prevents correct folding 
of the connecting region between h44 and h28 or h45, and therefore 
formation of the A and P sites. The biogenesis factors TSR1, LTV1 and 
RIOK2 probe this immature conformation through distinct terminal 
helices that bind underneath h44, probably contributing to its stabili- 
zation (Fig. 3b). 

TSRI1, an inactive mimic of translational GTPases’, interacts with 
the pre-40S body near rRNA helices h3-h5, h15 and h44 via domains I 
and III, with its N-terminal helix passing beneath h44 up to h12 (Fig. 3b 
and Extended Data Fig. 4b), as well as with the head via its domain 
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State E 


Fig. 3 | Positioning of h44 and maturation of the decoding centre. 

a, Comparison of h44 in its immature and mature position. b, Location of 
terminal helices of TSR1, LTV1 and RIOK2 between displaced h44 and the 
body. c, Interactions between TSR1 and h34 with the N-terminal extension 
of eS31. d, e, Clashes of biogenesis factor RIOK2 with unassigned factor X (d), 
RIOK1 with the C terminus of LTV1, and RIOK1 with eL41 (e). 


IV, as observed in yeast”. In addition, we observe distinct interactions 
between TSR1, pre-18S rRNA and eS31, which is recruited in state C. 
Together with domain IV of TSR1, the shifted N terminus of eS31 
(Extended Data Fig. 4c) coordinates a flipped out A1506 of the junction 
region of h32, h33 and h34, stabilizing h34 in an immature state after 
the rearrangement of the h34-h40 region (Fig. 3c). Supported by an 
insertion in domain II, which binds to uS13 and uS19 in the 3’ major 
domain, domain IV and eS31 might therefore serve as a link between 
maturation events within the beak and the intersubunit side. 

Another connection could represent LTV 1, of which we observed 
two interacting segments (Fig. 1a). In addition to the N-terminal part 
that binds ENP1 similarly to published structures”, its C terminus 
forms a long helix that stretches for about 60 A across the intersubunit 
side and ends beneath h44 (Fig. 3b), a position largely occupied by 
Dim1 in late yeast 40S precursors!®, Unlike in yeast, however, human 
dimethylation by DIMT1 is nuclear and, therefore, DIMT1 is not 
involved in the final maturation steps in the cytoplasm’. 

The kinase RIOK2 is observed in states C and D, binding to the 
decoding centre between the head and the platform as previously 
described?. In addition, we found its C terminus forming an elongated 
helix, which passes beneath h44 and is deeply buried within the tunnel 
that connects the solvent and intersubunit side (Fig. 3b). Surprisingly, in 
state E, factor X occupies parts of this tunnel with two helices, and super- 
imposition of both states reveals overlapping binding sites (Fig. 3d). 

Apart from RIOK2, BUD23-TRMT112 and the kinase domain 
of RIOK1 also bind to the neck and platform of pre-40S particles 
(Extended Data Fig. 4a, d). Furthermore, the C terminus of RIOK1 
binds between h27, h44 and h45, a site previously occupied by LTV1 
and later by eL41 in mature 40S particles (Fig. 3e). This is consistent 
with data suggesting that the C terminus of Rio] is crucial for its bind- 
ing to the ribosome in yeast!”. Furthermore, the clash of RIOK1 with 
LTV1 and RIOK2 could explain why the sole presence of RIOK1 and 
not its ATPase activity is sufficient for their release’. 


Coordination of site 3 cleavage by PNO1 and NOB1 

PNO1 has a decisive role in selectively binding and stabilizing h28 and 
h44 in an immature conformation in all states (Fig. 4 and Extended Data 
Fig. 5a). Superimposition of state C with mature 40S reveals two major 
clashes: First, the last helix of the K homology (KH) 2 domain of PNO1 
would clash with h28 (Fig. 4a and Extended Data Fig. 5b, c). This retains 


ARTICLE 


@ State R 
lm Mature 40S 


@ State C 
@ Mature 40S 


1210 
‘ : aot 
*-UUG ACGSAAGGGCA-"" *--UUG ACGSAAGGGCA~ 
LL} LILELEL has LE TEPilbibel 
AUGU UUGCCGU =e, ACAUGU UUCCCGU....., 


g ee 
A—U-——UUCCGUAGGU, j 1690 
1700 ae PT 700-0 ae 
AAGGCGUCCA ‘959 6—c 1S 1840 
a. = ey SUPPPEHAgSs 
U \ x 
c—G W c—ak } i 
G—E 1830 G— Giaad @AAGEC GU CCA er 
re e Hy aes u 
mag A A n44G A A 
Cc A A Cc A. Ul 
/ A AY 
1710 UTA & ist 10 U-A 
€— Ge 1820 State C C—Gr 809 Mature 


Fig. 4 | PNO1 interaction determines state of the decoding centre. 

a, Conformation of helices h28, h44 and h45 in native state C (left) and the 
reconstituted particle (state R, right) compared to their mature position 
(grey). Displacements stabilized by the presence of PNO]1 are indicated 
(arrows). b, Immature base pairing as a result of shifted helices h28 and 
h44. Summary of base interactions in state C and the mature 40S, with a 
focus on h28, h44 and h45. Base pairings are indicated by black lines and 
the relevant base stacking interactions by dashed lines. Immature helix 
highlighted in green. For an extended view, see Extended Data Fig. 6. 
ITS1, internal transcribed spacer. 


h28 in a backward position and prevents stacking between G1207 and 
G1837 (Extended Data Fig. 5d). Second, PNO1 would also clash with 
the connecting region between h44 and h45, prohibiting its mature 
fold while breaking another stacking between A1835 and A1863. As 
a result, A1863 is flipped out and can be recognized and bound by 
PNO1 (Extended Data Fig. 5e). Concomitant with the displacement 
of h44, novel immature base pairing is formed between nucleotides 
1699-1701 and 1836-1838 (Fig. 4b and Extended Data Fig. 5f). Notably, 
as observed in state R, incubation of fully matured 40S subunits with 
recombinant PNO1 (together with NOB1 and RIOK1) was sufficient 
to reverse the final maturation step by dissociating eS26 and forcing the 
rRNA to unfold partially (Fig. 4a, right, and Extended Data Fig. 5b). 

Besides stabilizing the central rRNA region in a distinct immature 
state and consistent with previous results”°, we found PNO] directly 
interacting with NOB1, which is absent in early pre-ribosomal 
particles®. Structurally, NOB1 can be divided into four parts: the PilT 
N-terminus (PIN) endonuclease domain, the insertion domain, the 
zinc-finger domain and the C-terminal domain. The PIN and zinc- 
finger domains form a stable core of NOB1 (Extended Data Fig. 5g) 
and mediate its interaction with ribosomal proteins uS2 and eS17 
(Fig. 5a). The insertion domain binds to the KH1 domain of PNO1 
involving residues Phe128 and Trp208 (Fig. 5b), and the C-terminal 
part of NOB1 forms a large loop, which interacts with the hydrophobic 
groove between the two KH domains of PNO] (Fig. 5c). 

In contrast to state R, we can trace several bases into the den- 
sity for the internal transcribed spacer 1 (ITS1) in all native states 
(Extended Data Fig. 5h), which is a clear indication that site 3 cleav- 
age has yet to happen. When compared with mature 40S, the 3’ end 
appears shifted from its mature position and is bound by PNO1 and 
NOBI. The C-terminal domain of NOB1 contains several positively 
charged residues that stabilize the backbone of the 3’ end. In addi- 
tion, Glu120 of the insertion and Arg398 of the C-terminal extension 
hold the ultimate nucleotide before site 3, A1869, in a tight pocket. 
In parallel, PNO1 binds the 3’ end and the ITS1 region via residues 
within its two KH domains, with Arg182, Ser235 and Tyr238 forming 
hydrogen bonds with G1862 and A1863, contributing to a sequence 
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Fig. 5 | NOB1 and PNO1 coordinate site 3 in an inactive conformation. 
a, Domain architecture and overall structure of NOB1 shows binding of 
NOB1 by PNO1, uS2 and eS17. b, c, Molecular detail of the interface of 
the insertion domain (b) and the C-terminal domain (c) of NOB1 with 


specificity in binding (Fig. 5d and Extended Data Fig. 5h), essentially 
resembling the situation observed in yeast”. Importantly, the RNA 
around cleavage site 3 is captured at a large distance (approximately 
30 A) from the rigid catalytic centre of the nuclease NOB1 by both, 
PNO1 and NOB1 itself (Fig. 5e). Movement of the ITS1 towards the 
catalytic centre is prevented by the insertion and extension domain 
of NOB1. As a consequence, the PNO1-NOB1 complex is kept in a 
catalytically silent conformation along the observed maturation pro- 
cess. Clearly, a large conformational change, and probably release of 
PNO1, will be required to trigger the cleavage of site 3 by releasing 
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PNO1. d, Coordination of the pre-18S rRNA 3’ end by NOB1 and PNO1. 
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catalytic centre (dashed circles). 


the RNA substrate from its confinement. This would be in agreement 
with previous hypotheses suggesting that, in the presence of Riol, 
cleavage at site D in yeast (site 3 in humans) by Nob] is facilitated 
after Pnol dissociation”!. 


Conclusion 

Taken together, late maturation of human small ribosomal subunit 
occurs in a sequential manner (Fig. 6): The largely mature 40S body 
(state A) is further completed by the incorporation of several riboso- 
mal proteins, allowing for recruitment of NOB1 as observed in state 
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Fig. 6 | A model of stepwise maturation of the human 40S ribosomal 
subunit. Illustrations of distinct human 40S precursors during head 
formation and 3’ processing. Transitions in RBF composition not shown in 
this work are displayed with dotted arrows. Movement of h44 and cleavage 


NATURE|www.nature.com/nature 


of site 3 by NOB1 is shown in inserts with focus on PNO1 (green), h44 
(yellow and blue), NOB1 (orange) and uS26 and uS28 (grey). The 60S 
model is based on EMDB-5592. 
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B. At the same time, the head and beak are formed by going through a 
sequence of distinct immature rRNA conformations, which is guided 
by interactions with BUD23, TRMT112, RRP12, ENP1, LTV1, TSR1 
and RIOK2, and eventually results in export competence. Once in 
the cytoplasm, several factors have been suggested to have a role 
in eukaryotes in triggering the final maturation (adopting native 
conformation, factor release, site 3 cleavage), including RIOK1, elF5B 
(Fun12 in yeast) and mature 60S subunits”. Interestingly, eIF5B as 
well as mature 60S subunits would clash at several sites with our state 
E, which may contribute to driving these last structural rearrange- 
ments (Extended Data Fig. 7). Finally, formation of the decoding 
region is inevitably accompanied by dissociation or repositioning of 
PNOI1, which, in turn, may facilitate the cleavage of 18S-E rRNA at 
site 3 and thus concludes 40S formation. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0193-0. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Molecular cloning. An N-terminal Strep-Flag tag was added to the multiple 
cloning site of the commercial vector pcDNA5/FRT/TO (Invitrogen). The region 
coding for PNO1 was amplified from human cDNA (Amsbio) using KOD DNA 
polymerase (Merck) and inserted into the modified pcDNA5/FRT/TO_StFLAG 
vector with BamHI and XhoI. DNA coding for human RIOK1, PNO1 and NOB1 
was amplified from a HEK293T cell reverse transcription cDNA library and cloned 
into a modified pET 28a vector (SUMO tag). 

Generation of cell lines. The generation of cell lines that stably express Strep-Flag- 
tagged PNO1 was adapted from previous work”. In brief, HEK Flp-In 293 T-Rex 
cells (Invitrogen) were grown to 70% confluency and transfected with pcDNA5/ 
FRT/TO_StFLAG-PNO1 and pOG44 (Invitrogen) using Lipofectamine 3000 
(Thermo Scientific). Selection was performed following the manufacturer’s pro- 
tocols with 150 jg ml“! hygromycin B (Thermo Scientific). Cells were maintained 
in DMEM (Thermo Scientific), containing 10% FBS, 100 jg ml! hygromycin B, 
10 g ml"! blasticidin, and 1x penicillin/streptomycin and GlutaMAX 
(Thermo Scientific). Cell lines were not authenticated or tested for mycoplasma 
contamination. 

Native complex preparation. Native 40S precursors were purified from a stable 
HEK293 T-Rex Flp-In cell line as previously described®. Twenty-four hours before 
collection, PNO1 expression was induced with 1.6 jig ml! tetracycline. Cells were 
collected in 0.025% trypsin/EDTA (Thermo Scientific), pelleted for 7 min at 1,800g 
and 4°C and washed once with PBS. Cells were then lysed for 30 min on ice using 
lysis buffer (10 mM HEPES pH 7.6, 100 mM KCl, 2mM MgCh, 1 mM dithiothrei- 
tol (DTT), 0.5 mM Na, 0.1 mM NaV304, 0.5% NP-40 substitute, and 1 protease 
inhibitor (Sigma Aldrich)). The lysate was cleared for 15 min at 4,400g and 4°C 
before transfer to equilibrated StrepTactin XT affinity beads (IBA Lifesciences), 
followed by a 2 h incubation in an overhead rotator. Beads were collected in small 
columns and washed four times with buffer A (10 mM HEPES pH 7.6, 100 mM KCl, 
2mM MgCh, 1 mM DTT, 0.5 mM NaK, 0.1 mM Na2V3O, and 1 x protease inhibitor). 
Bound material was then eluted five times with 30 min incubation in buffer 
A containing 50 mM p-biotin (Roth). Combined eluates were transferred to 
equilibrated anti-Flag affinity beads (Sigma-Aldrich), incubated for 2 h, followed 
by five washing steps with buffer A and elution of purified complexes by five times 
incubation for 30 min with 0.2 mg ml“! 3x Flag peptide (Sigma Aldrich) in buffer A. 
Finally, the combined eluates were concentrated on 300 kDa molecular mass 
cut-off filters (Sartorius) and analysed for protein concentration on a NanoDrop 
photometer (Thermo Scientific). Nikkol was added to a final concentration of 
0.05% to improve ice quality after grid preparation. Samples were then analysed 
by polyacrylamide gel electrophoresis on 4-12% Bis-Tris gradient gels (NuPAGE, 
Thermo Scientific), stained with SimplyBlue (Thermo Scientific) and bands were 
cut out and identified separately in-house via mass spectrometry. 

Reconstitution of pre-40S complexes. For recombinant expression, NOB1, PNO1 
and RIOK1 plasmids were transformed into Escherichia coli B]21 strains and 
induced with 0.1 mM IPTG at 15°C. Cells were lysed in 50 mM Tris pH 8.0, 300 mM 
NaCl, the lysate subjected to Ni-NTA affinity chromatography and proteins eluted 
overnight by on-column cleavage with Ulp1 at 4°C. Samples were purified further 
using ion exchange and size exclusion chromatography with a final buffer condition 
of 20 mM Tris-HCl pH 8.0, 300 mM NaCl, and 5 mM DTT. 

Human 40S subunits were purified from a HEK293T cell line. Cells were collected 
and disrupted using lysis buffer (20 mM HEPES pH 7.4, 100 mM potassium acetate, 
7.5mM magnesium acetate2, 1 mM DTT, 10 jg cycloheximide, 0.5% NP-40 and 1x 
protease inhibitor) and 80S ribosomes pelleted using a sucrose cushion (50 mM 
Tris pH 8.0, 500 mM potassium acetate, 25 mM magnesium acetate, 1 mM DTT, 
1 M sucrose and 0.1% Nikkol). The resuspended ribosomes were stored in subunit 
separation buffer (50 mM HEPES pH 7.4, 500 mM KCl, 2 mM MgCl and 2 mM 
DTT). To obtain separated 40S subunits, resuspended ribosomes were treated with 
puromycin at a final concentration of 1 mM, first 15 min on ice and then 10 min 
at 37°C. After the treatment, the mixture was applied onto a 10-40% sucrose gradient 
and run overnight in a SW 40 rotor at 49,500g for 18 h. The 40S peak was collected 
and changed to storage buffer conditions (20 mM HEPES pH 7.4, 100 mM 
potassium acetate, 2.5 mM magnesium acetate and 2 mM DTT). 

For the reconstitution of pre-40S complexes, purified small subunits (approx- 
imately 0.1 1M) were mixed with 2 1M RIOK1, 2 tM PNO1 and 2 1M NOB1 in 
storage buffer on ice for 30 min. Complexes were stabilized by crosslinking by add- 
ing glutaraldehyde to a final concentration of 0.5% (v/v). After 10 min incubation 
at room temperature, grids were prepared as described. 

Electron microscopy and image processing. Pre-coated (2 nm) R3/3 holey 
carbon supported copper grids (Quantifoil) were glow discharged at 0.2 hPa for 20s. 
Then, 3.5 ,1l of sample was directly applied onto each grid, blotted for 2-3 s at 4°C 
and plunge frozen in liquid ethane using a Vitrobot Mark IV (FEI Company). 


Grids were screened for ice quality and cryo-EM data acquired on a Titan Krios 
transmission electron microscope (FEI Company) at 300 kV under low-dose con- 
ditions (10 frames at about 2.5 e~ A~) with a nominal pixel size of 1.084 A per pixel 
on the object scale using the semi-automated software EM-TOOLS (TVIPS). A total 
of 17,345 and 11,977 micrographs of the native PNO1-pullout were collected on a 
Falcon II direct electron detector (datasets 1 and 2) at nominal defocus ranges from 
—1.0 to —2.5 jum. In total, 2,663 and 8,115 micrographs were collected of the two 
datasets ‘mature 40S’ (dataset 3) and ‘reconstituted pre-40S’ (dataset 4). Original 
image stacks were aligned, summed and drift-corrected using MotionCor2™. 
Contrast-transfer-function parameters and resolution were estimated for each 
micrograph using CTFFIND4” and Gctf*®, respectively. Micrographs with an 
estimated resolution below 4 A and astigmatism below 5% were manually screened 
for contamination or carbon rupture. Then, 11,356, 10,870, 2,523 and 7,753 micro- 
graphs were submitted to automated particle picking using Gautomatch (https:// 
www.mrc-lmb.cam.ac.uk/kzhang/Gautomatch/) and human 40S (EMDB-5592) as 
reference, resulting in 1,154,906, 1,637,274, 407,657 and 959,348 picked particles. 
These were then reference-free 2D-classified and classes individually 3D refined 
as shown in Extended Data Fig. 1 using Relion (version 2.1-beta-1)””"*. Datasets 
1 and 2 were individually processed before all particles of corresponding states were 
combined and further analysed (see Extended Data Fig. 1d). 

Model building and refinement. In general, for the molecular model building of 
ribosome biogenesis factors, we initially performed secondary structure prediction 
(PSIPRED”’) and, whenever crystal structures of homologues were available, we 
used 3D structure prediction by using SWISS-MODEL™. A full model was built 
for state C, while working models were prepared for all other states. 

In class C, the Saccharomyces cerevisiae crystal structure (PDB code 5IW7) was 
used as a reference for TSR1, followed by manual refinement and de novo building 
of the missing extensions (47-75, 392-405 and 464-485) into the density in Coot*!. 
For RIOK2, the Chaetomium thermophilum crystal structure (PDB code 4GYG) 
was used to generate a homology model, which was then rigid body fitted into 
its density and manually adjusted, before building de novo its C-terminal helices 
(494-544). The PNO1 structure is based on the Pyrococcus horikoshii crystal 
structure (PDB code 3AEV) with some minor adjustments to account for structural 
differences. For NOBI, the P. horikoshii NMR solution structure (PDB: code 2LCQ) 
was used as a reference to build a starting model for the PIN domain, followed by 
a manual refinement. The zinc-finger, insertion and C-terminal extension were 
all built de novo (94-233 and 256-412). The S. cerevisiae crystal structure (PDB 
code 5WWO) was used as a starting model for ENP1 and parts of LTV1. ENP1 
and the N terminus of LTV1 were then manually adjusted and built into the map 
in Coot, respectively. The C-terminal helix of LTV1 (414-417) was built de novo. 
The 18S rRNA and associated ribosomal proteins were modelled using the human 
40S ribosome (PDB code 5A2Q) asa reference, followed by rigid body fitting and 
manual adjustment. Immature h34 and h44, the linker region between h44 and 
h45, as well as the 3’ end and ITS1 were built into their respective densities. The 
final model was real space refined at a resolution of 3.6 A with secondary structure 
restraints for proteins and RNA, generated by ProoMART™ and LIBG*’, using 
PHENIX™ and REFMAC5*. Final model evaluation was done with MolProbity*°. 
Overfitting statistics were calculated by a random displacement of atoms in the 
model followed by a refinement with REFMACS against one of the half maps. 
Finally, Fourier shell correlation curves are calculated between the volume of the 
refined model and both half maps using Relion. 

For state A, our model of state C was fitted into the density, before removing 
or adjusting parts to account for conformational and compositional differences. 
Namely, NOB1, LTV1, RACK1, RIOK2, as well as uS2, uS5, eS12, eS21, eS31 and 
parts of eS17 were removed from the model. A secondary structure model of 
RRP12 obtained in state B was placed in the respective density. Models for imma- 
ture rRNA helices h34 and h39-40 were used from a model of state B and manually 
adjusted to their slightly different conformation. Helices h35-h37 from state C were 
rigid body fitted to additional densities enclosed by RRP12. A homology model of 
BUD23 and TRMT112 was generated based on PDB code 4QTU and rigid body 
placed into its respective density. 

For state B, the model for state C was rigid body docked into the density. Well- 
resolved areas within the body were then Phenix real space refined, while less 
resolved areas within the head were solely manually checked. To account for struc- 
tural differences, RACK1 and RIOK2, as well as eS12 and eS31 were removed from 
the structure. Flipped out 18S rRNA regions h34, h39 and h40 from state C were 
rigid body fitted together with the N-terminal part of eS17 into the focus-refined 
map in Chimera. RRP12 was modelled on a secondary structure level by manually 
placing poly-alanine alpha-helices in rod shaped densities. 

For state D and E, rRNA and ribosomal proteins from the mature state, as well 
as NOB1, PNO1, RIOK2, LTV1 and TSR1 from state C were rigid body fitted into 
their respective density using Chimera. 

A model for the mature 40S (PDB code 5A2Q) was fitted into and Phenix real 
space refined against the final mature 40S volume. 
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Finally, 18S rRNA and ribosomal proteins of the mature state were fitted into the 
reconstituted 40S precursor volume (state R). NOB1 and PNO1 were used from our 
model of state C and for RIOK1, only the C terminus which shows high resolution 
was de novo built, while the kinase domain was left out. 

Maps and models were visualized and figures created with the PYMOL 
Molecular Graphics System (Version 1.7.4, Schrédinger, LLC), ChimeraX*’ and 
UCSF Chimera. 

Statistics and reproducibility. Purification and sample preparation of native 
pre-40S complexes was done twice (n = 2) with equal results. Cryo-EM data from 
two different grids has been collected with similar results (see Extended Data 
Fig. 1d). Reconstitution of 40S precursors and respective cryo-EM data collection 
has been done once (n = 1). No statistical analysis has been applied throughout 
the work. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability statement. All cryo-EM density maps have been deposited 
in the Electron Microscopy Data Bank (EMDB) under the accession codes 
EMD-4349, EMD-4348, EMD-4337, EMD-4350, EMD-4351, EMD-4353 and 
EMD-4352. The atomic model of state C and all working models have been depos- 
ited in the Protein Data Bank (PDB) under accessions 6G4W, 6G4S, 6G18, 6G51, 
6G53, 6G5I and 6G5H. 
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Extended Data Fig. 1 | Sample preparation and cryo-EM analysis of the 
pre-40S ribosome. a, Coomassie stained SDS-PAGE analysis of samples 
used in this work with bands labelled as identified by mass spectrometry. 
RBFs that we do not observe in our structures are marked with an asterisk. 
60S bands are shown for comparison (see Supplementary Fig. 1 for source 
data). b, Representative micrographs from three datasets, low-pass filtered 
at 15 A. c, Representative 2D classes of three datasets showing various 
orientations of pre-40S particles. d, Cryo-EM data processing scheme 

for all datasets with final volumes highlighted in red. Classes that could 
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not be further refined or sorted are labelled flexible or noisy. For PNO1 
N-TAP, two datasets were collected and individually processed. Particles 
of respective states were combined and further analysed (see Methods). 
Complex purification was done three times independently with the same 
results. Cryo-EM data collection and analysis was done twice for native 
complexes with similar results. Mass spectrometry analysis, pre-40S 
reconstitution and data collection for datasets 3 and 4 were done once. 
DS, dataset. 
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Extended Data Fig. 2 | Local resolution, refinement and model particle from yeast (PDB code 6EML). Assembly factors coloured as in 
statistics. a, Local resolution distribution as estimated by Relion ranging Fig. la. e, Cartoon representations of models of RBFs (top) are shown 
from approximately 3 A in well-resolved areas to 12 A in more flexible together with their respective density (mid). Volumes are coloured 
parts. Colouring according to scale bars. b, Fourier shell correlation according to local resolution, which ranges from 3 A in more rigid areas 
(FSC) plot with average resolutions according to the ‘gold standard’ to 9 A in flexible parts. RRP12, BUD23 and TRMT112 are less resolved, 
(FSC = 0.143) stated in the legend. c, FSC plot of the state C model against —_ which only permitted placing of dummy helices and rigid body fitting 
cryo-EM maps as calculated by REFMACS (see Supplementary Data 1 of homology models, respectively. Examples of well resolved areas are 
for source data). d, Structural comparison between state C and a pre-40S depicted below. 
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formation. a, Positioning of RRP12 in state B shows interaction with eS17 view. c, Close-up view of the beak region before (left) and after (right) 
and uS9, as well as rRNA h34, h39, h40 and h41. Two additional helices replacement of ENP1-LTV1 with the uS3-eS10-uS20 cluster. c, Movement 


of unknown identity bridge RRP12 and the head. Poly-alanine helices of h34 during maturation. 
are placed in respective densities. b, Recruitment of NOB1 and the 
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(bottom) representation showing the position of the biogenesis factors finger domain. d, Overall structure of the reconstituted particle low-pass 
close to h44. b, Domain arrangement of TSR1. Insertions and extensions filtered at 12 A, showing the approximate positioning of RIOK1 (blue). 
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Extended Data Fig. 5 | PNO1, NOB1 and coordination of the 3’ rRNA 
end. a, The 3’ end region (box) throughout the maturation process. PNO1 
and NOBI bind similarly in state B-E and R. b, Cartoon representation 
of eS26 and PNO1 with parts of the pre-18S rRNA at different stages in 
maturation. h44 is shifted in state C. Displacement of eS26 by PNO1 in 
state R leads to a partially shifted h44. c, Overall structure of PNO1 with 
its two KH domains. d, e, Detailed view on the effects of PNO1 binding. 
Clashing of PNO1 with h28 and the linker region between h44 and h45 
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and A1835 and A1863 (e). f, Residues involved in novel base pairing with 
their respective electron density in state C. g, Comparison of the PIN 
domain of human (hs) NOB1 in state C and the NMR structure of Nob1 
from Pyrococcus horikoshii (ph; PDB code 2LCQ), with their conserved 
active site residues highlighted. h, Summary of residues involved in 
binding of the rRNA 3’ end (asterisk indicates stacking). Electron density 
of state C surrounding the 3’ end is shown. 
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Extended Data Fig. 6 | 2D diagram of the pre-18S rRNA head region of state C. Extended secondary structure diagram of the pre-18S rRNA head 
region of state C. Residues mentioned throughout the text are highlighted. Related to Fig. 4b. 
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Extended Data Fig. 7 | 60S subunits components clash with immature 28S rRNA clash with h44 in its immature position. Right, finally helix H66 
h44. Overview over clash sites of large subunit components with pre-40S of 28S rRNA clashes with a RIOK1 helix. A model of an 80S ribosome with 
particles. Left, initiation factor eIF5B, which is potentially involved in eIF5B (PDB code 4UJD) was aligned to pre-40S states, and the factors are 
formation of the 80S-like ribosome complex, occupies a similar binding shown as overlays together with one of the 40S precursors. 


site as TSR1. Middle, ribosomal protein uL23 and helices H69 and H71 of 
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Extended Data Table 1 | Cryo-EM data collection, refinement and validation statistics 


ARTICLE 


PNO1-TAP State R Mature 40S 
(EMD-4437) (EMD-4353) (EMD-4352) 
(PDB-6G18) (PDB-6GS5I) (PDB-6G5H) 

DATA COLLECTION 

Magnification 129,151 129,151 129,151 

Voltage (kV) 300 300 300 

Electron exposure (e- A’) 25 25 25 

Defocus range (um) -0.4 to -3.2 -0.7 to -3.3 -0.7 to -3.8 

Pixel size (A) 1.084 1.084 1.084 

Symmetry imposed Cl Cl Cl 

Initial particle images (no.) 1,737,823 959,348 407,657 

Final particle images (no.) 287,847 (state C) 83,883 70,822 

Map resolution (A) 3.6 (state C) 3.5 3.6 

FSC threshold 0.143 0.143 0.143 

Map resolution range (A) 3.3-7.5 3.3-6.8 3.4-8.6 

MODEL REFINEMENT State C 

Refinement 

Initial model used 5A2Q 

Model resolution (A) 3.6 

FSC threshold 0.5 

Map sharpening B factor (A?) -210.9 

Model resolution range (A) 3.3-7.5 

Model composition 

Non-hydrogen atoms 83825 

Protein residues 6060 

RNA bases 1658 

Ligands 2 

B factors (A?) 141.1 

R.m.s. deviations 

Bond lengths (A) 0.0068 

Bond angles (°) 1.19 

Validation 

Molprobity score 2.46 

Clashscore, all atoms 6.28 

Poor rotamers (%) 7.77 

Ramachandran plot 

Favoured (%) 93.33 

Allowed (%) 5.85 

Outliers (%) 0.82 

Validation (RNA) 

Correct sugar puckers (%) 96.14 

Good backbone conformations (“%) 67.37 


Summary of relevant parameters used during cryo-EM data collection and processing. Refinement and validation statistics are provided for the molecular model of state C. 
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Stellar populations dominated by massive stars in 
dusty starburst galaxies across cosmic time 


Zhi-Yu Zhang!?, D. Romano’, R. J. Ivison?*, Padelis P Papadopoulos*° & F. Matteucci®”® 


All measurements of cosmic star formation must assume an 
initial distribution of stellar masses—the stellar initial mass 
function—in order to extrapolate from the star-formation rate 
measured for typically rare, massive stars (of more than eight 
solar masses) to the total star-formation rate across the full stellar 
mass spectrum!. The shape of the stellar initial mass function in 
various galaxy populations underpins our understanding of the 
formation and evolution of galaxies across cosmic time’. Classical 
determinations of the stellar initial mass function in local galaxies 
are traditionally made at ultraviolet, optical and near-infrared 
wavelengths, which cannot be probed in dust-obscured galaxies”®, 
especially distant starbursts, whose apparent star-formation rates 
are hundreds to thousands of times higher than in the Milky Way, 
selected at submillimetre (rest-frame far-infrared) wavelengths*>. 
The °C/'80 isotope abundance ratio in the cold molecular gas— 
which can be probed via the rotational transitions of the 3CO and 
C180 isotopologues—is a very sensitive index of the stellar initial 
mass function, with its determination immune to the pernicious 
effects of dust. Here we report observations of '*CO and C'8O 
emission for a sample of four dust-enshrouded starbursts at 
redshifts of approximately two to three, and find unambiguous 
evidence for a top-heavy stellar initial mass function in all of 
them. A low °CO/C'%0 ratio for all our targets—alongside a 
well tested, detailed chemical evolution model benchmarked 
on the Milky Way°—implies that there are considerably more 
massive stars in starburst events than in ordinary star-forming 
spiral galaxies. This can bring these extraordinary starbursts 
closer to the ‘main sequence’ of star-forming galaxies’, although 
such main-sequence galaxies may not be immune to changes in 
initial stellar mass function, depending on their star-formation 
densities. 

Oxygen, carbon and their stable isotopes are produced solely by 
nucleosynthesis in stars®. The minor isotopes, 13C and 180, are released 
mainly by low- and intermediate-mass stars (those with stellar mass 
less than eight solar masses, M« <8Mo) and massive stars (M« > 8Mo); 
respectively®, owing to their differing energy barriers in nuclear 
reactions and evolution of stars!°. These isotopes then mix with the 
interstellar medium (ISM) such that the '°C/!8O abundance ratio meas- 
ured in the ISM becomes a ‘fossil; imprinted by evolutionary history 
and the stellar initial mass function (IMF)°. The abundances of the 
BCO and C#’O isotopologues in the molecular ISM, whose measure- 
ments are immune to the pernicious effects of dust, are therefore a very 
sensitive index of the IMF in galaxies. 

Galaxies in the early Universe, having had much less cosmological 
time available for prior episodes of evolution, are expected to have sim- 
pler star-formation histories than local galaxies. Our sample comprises 
the strongest carbon monoxide (CO) emitters in the early Universe, 
selected from the literature (see Methods): four gravitational lensed 
submillimetre galaxies at redshift z~ 2-3, with look-back times exceed- 
ing ten billion years. 


Using the Atacama Large Millimeter Array (ALMA) telescope, 
we have robustly (>50, where a is the standard deviation) detected 
multiple transitions of *CO and C'%0 in most of our target galax- 
ies. The J=3 — 2 lines from galaxy SDP.17b and the J=5 — 4 lines 
from galaxy SPT 0103—45 are marginally detected at approximately 
4a levels. But the J=4 — 3 transitions of SDP.17b are detected at high 
signal-to-noise ratio, so we can be confident that emission features 
seen at the expected velocities of the weaker transitions are also real. 
We also detected CO J=4—3 and J=5—4 for SPT 0125—47 and 
SPT 0103—45, respectively. 

As shown in Fig. 1, there is a decreasing trend in the ratio of velocity- 
integrated line intensities, I(‘*CO)/I(C'%0), with increasing infrared 
luminosity, Lyp (or the apparent star-formation rate traced by mas- 
sive stars). For all the galaxies in our observed sample (see Methods), 
the line ratios of I(‘3CO)/I(C'80) are close to unity, similar to those 
found!!! in three nearby ultraluminous infrared galaxies (ULIRGs, 
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Fig. 1 | I(!3CO)/I(C180) as a function of Lyp, corrected for gravitational 
amplification when appropriate. The rest-frame for Lyp is 8-1,000 xm. 
Red symbols refer to submillimetre galaxies (SMGs) in our sample. 

We include I(13CO)/I(C'8O) measurements of the Milky Way, nearby 
star-forming spiral galaxies, disks, a few resolved nuclei'®, three local 
ULIRGs!!, Arp 220, Mrk 231, and IRAS 13120—5453, and a submillimetre 
galaxy'?, SMM J2135—0102 at z~ 2.3. The ratios of the Small and 

Large Magellanic Clouds (SMC and LMC) are averaged from multiple 
positions*®”’. 1(3CO)/I(C'8O) in dwarf galaxy IC 10 is reported as a 
lower limit®”. A decreasing trend of I(‘*CO)/I(C'80) as a function of Lig 
is clearly evident, indicating that '?CO/C'8O abundance ratios are varying 
systematically in galaxies with different rates of apparent star formation. 
Error bars represent 1o uncertainty. 
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Fig. 2 | Theoretical °C and '°O isotopic abundance ratios in the 
ISM for different evolutionary tracks, predicted using various IMFs. 
a, Star-formation history for a delayed starburst, starting two billion 
years after the Big Bang, with a total cessation of subsequent star 
formation. Coloured lines correspond to different IMFs. b, Theoretical 


with Lip > 10'Lo)—Arp 220, Mrk 231 and IRAS 13120—5453—as 
well as in the strongly lensed submillimetre galaxy’? SMM J2135—0102 
at redshift z= 2.3. Galactic disks of nearby spiral galaxies have 
I(3CO)/I(C180) ratios similar to the representative ratio®* of the 
Milky Way’s disk, about 7-10. In the central nuclear regions of these 
spiral galaxies, where the star-formation activity is more intense than 
in the disks, I('°CO)/I(C'80) ratios are lower’®, though they remain 
restricted to I(13CO)/I(C!8O) > 4. The Magellanic Clouds—our 
nearest dwarf galaxies—show the highest I(!*CO)/I(C!80) ratios of 
about 30-60. 

For representative Galactic abundance ratios of '*CO/C!8O of 
about 7-10, the I(°CO)/I(C!80) line intensity ratio can approach 
values near unity—which is what we measure for all the galaxies in 
our sample—only if even the rarest of our three isotopologue lines, 
C'80, were to acquire substantial optical depths on galactic scales 
(see Methods). On the other hand, to reach line ratios I(!7CO)/I(3CO) 
and I(!2CO)/I(C80) in excess of 30—as found in our sample—the 
optical depths of !3CO and C!8O have to be much less than one 
for either type of condition—local thermodynamic equilibrium (LTE) 
or non-LTE excitation (see Methods)—assuming the typical abundance 
ratios of '*CO/'3CO ~ 40-100 found in the Milky Way®*. 

The magnification factors of gravitational lensing in our objects are 
modest (ju 5, see Extended Data Table 1), with the notable excep- 
tion of the Cloverleaf quasar (ju 10). It is unlikely that differential 
lensing could skew the measured I(13CO)/I(C!80) line ratio away 
from the value intrinsic to the galaxy, even in cases of much stronger 
lensing’. For differential lensing to operate in this way, the global 
I(3CO) and I(C180) distributions over the galaxies must be very 
different, which is improbable given that these two isotopologue lines 
have almost identical excitation requirements and any differences in 
their distribution are expected to be confined within individual molec- 
ular clouds (see Methods). Finally, the isotopologue lines have been 
observed simultaneously, making the uncertainties from pointing and 
calibration negligible. Furthermore, it was recently shown that known 
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13C/8O abundance ratio in the ISM as a function of time, following the 
different IMFs (shown in colour). Red symbols refer to submillimetre 
galaxies (SMGs) in our sample. Blue symbols show the '?CO/C'"O ratios 
measured in SMM J2135—0102 and a few representative local galaxies. 
Table 1 lists the detailed definitions of the IMFs adopted here. 


photo-chemical effects, such as selective photodissociation and frac- 
tionation, cannot induce global isotopologue abundances to differ from 
the intrinsic, IMF-determined, isotopic abundances in star-forming 
galaxies®!?, 

We thus conclude that emission in both '°CO and C'80 is optically 
thin for the bulk of the molecular gas mass in these galaxies. The sys- 
tematically low I(‘°CO)/1(C!80) ratios found in all our high-redshift 
starbursts—as well as in local ULIRGs—reflect intrinsic isotopologue 
abundance ratios over galaxy-sized molecular hydrogen reservoirs, that 
is, I(°CO)/I(C8O) = 8CO/C!80 = 8/180. Figure 1 thus reflects a 
strong decrease of the °C/!8O abundance ratio in starburst galaxies, 
compared to local spiral galaxies, galactic disks and dwarf galaxies. 

The only plausible basis for such systematic variations of isotopo- 
logue abundance ratios over galaxy-sized molecular hydrogen res- 
ervoirs is a change of the stellar IME, which must cause the intrinsic 
abundances of isotopic elements to deviate substantially from those 
found in ordinary star-forming systems where the standard IMF pre- 
vails. The robustness of this conclusion is made possible by tremen- 
dous advances in chemical evolution modelling, which now takes into 
account up-to-date stellar isotopic yields in dependence on both stellar 
masses and metallicities, time differentials for their release into the 
ISM, and the dependence of stars’ initial chemical composition on prior 
galactic evolution. Benchmarked against the rich isotopic datasets of 
the Milky Way, these models can now follow the chemical evolution of 
various isotopes and their abundance ratios, uniquely identifying the 
effects of different IMFs upon them® (see Methods). 

In Fig. 2, we present chemical evolution models that show how the 
isotopologue abundance ratios are altered by IMF types, and how they 
evolve as a function of cosmic time. The models show a massive galaxy 
that started an intense burst of star formation at z~ 3, reached a stellar 
mass of 10''Mo one billion years later, ceased forming stars after the 
burst, then evolved passively to the present day. This represents an 
extreme case for the evolution of the '°C/'8O abundance ratio for a 
pure starburst. The !3C/!8O abundance ratio starts from a high value 
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IMF a0 a ae mo (Mo) mi (Mo) mz (Mc) Ms (Mo) MS 100MS jy total cg 
Bottom-heavy —1.7 S17 =17 0.1 0.5 1.0 100 3.9 

Kroupa?© —0.3 —1.2 =1,7 0.1 0.5 1.0 100 6.9 

Top-heavy —0.3 =1,1 =i 0.1 0.5 1.0 100 33.3 

Ballero!® —0.3 —0.95 —0.95 0.1 0.5 1.0 100 44.0 


The slopes quoted in the table are for IMFs in mass, where mo (=0.1M,) and m3 (=100M,) are, respectively, the lower and upper limits of stellar masses assumed in the models, that is, the IMF is 
normalized to unity in the (0.1-100)M, range. m; and mz indicate the masses at which there is a change in the IMF slope, if any. For instance, for the Kroupa IMF, the slope changes at both m, and 
mo; for the Ballero and top-heavy IMFs, the slope changes only at m1; finally, the bottom-heavy IMF has a single slope. The Kroupa IMF slopes are adopted for reproducing typical Milky Way values in 


chemical evolution models®, which are within the error bars of the original reported values?. 


set by the first generations of metal-poor massive stars, because ‘°C 
is released from its primary and secondary nucleosynthesis channels 
(see Methods) earlier than '8O, which is purely a secondary element 
(see Methods). The ratio drops quickly during the starburst, then 
slowly increases with time, varying by a factor of 2-3 depending on 
the adopted IMF and the time interval. The late increase of the ratio 
is due mostly to the slow but continuous release of °C from low- and 
intermediate-mass stars (see Methods), which keep releasing °C for a 
long time after star formation—and, at the same time, the 189 pollution 
from massive stars—has ceased; wiggles in the '°C/'8O abundance ratio 
correspond to the lifetimes of roughly solar-mass stars. 

It is not possible to reproduce the observed I(#7CO)/I(C180) ratio 
in submillimetre galaxies with a Kroupa IMF (or similar IMFs; see 
Table 1), that is, with an IMF that can reproduce the ratios found in the 
Milky Way and in the disks of local spiral galaxies’. The top-heavy IMF 
and the Ballero!® IMF (which can reproduce the chemical abundances 
of stars in the Galactic bulge) under-produce the I (3CO)/I(C0) ratios 
observed in z= 2-3 starburst galaxies and local ULIRGs, but they 
can reproduce the extremely low I(‘°CO)/I(C!80) ratio recently 
measured’? in the central 500-parsec region of the starburst ULIRG, 
TRAS 13120—5453, which has been until now the lowest value reported 
in the literature. Note that the average star-formation event may have 
a less top-heavy IMF over galactic scales, or a mix of both top-heavy 
and canonical IMFs that produces a galaxy-sized average '°C/'8O < 1, 
which also applies for the resolved studies of IRAS 13120—5453. 

A clear trend is shown in Fig. 2: the more top-heavy the IME, the 
lower the I(°CO)/I(C}80) ratio, which is also compatible with the ratios 
found in local ULIRGs and the exceptionally small 1(7CO)/I(C180) 
ratio found’’ in the centre of IRAS 13120—5453. This paints a con- 
sistent picture in which a top-heavy IMF operates within both local 
ULIRGs and the much more numerous, distant starburst galaxies, 
where starburst events can quickly enrich the 80 abundance, push- 
ing the 13/180 ratio to (or below) unity. A canonical IMF can never 
produce a I(!3CO)/I(C!80) ratio close to unity, no matter what type of 
star-formation history or at what time along a galaxy’s evolutionary 
track the measurement is made. 

Multiple evidence in the local Universe has shown that the stellar 
IMF in galaxies with very high star-formation rate densities seems to 
be biased towards massive stars, such as ultra-compact dwarf galaxies!”, 
ULIRGs'* and progenitors of early-type galaxies’’. A top-heavy stellar 
IMF was recently also found in compact stellar associations in the 
Large Magellanic Cloud”*”’, whose high-density star-formation 
events may closely replicate what happens over galactic scales in 
distant starbursts. Our results—for the most intensive star-forming 
systems in the distant Universe, where classical ultraviolet and 
optical methods cannot be applied—are in line with these findings. 
We also note that metal-poor dwarf galaxies are likely to have an IMF 
biased towards low-mass stars, which is predicted by the integrated 
galaxy-wide IMF theory and is consistent with the results found in 
dwarf galaxies”? and the outer regions of disk galaxies, using Ha and 
ultraviolet observations‘. 

An IMF biased towards massive stars implies that the star-formation 
rates determined for submillimetre galaxies must be considerably 
reduced, since they are based on extrapolations of observables related 
to massive stars!. Moving from the Kroupa IMF to the Ballero IME, 
the relative mass fraction of massive stars increases by a factor of 6-7 


(see Table 1), meaning that star-formation rates derived from most clas- 
sical tracers! (the Lyp, the radio continuum and so on) must decrease by 
a similar factor. As a result, dusty starburst galaxies probably lie much 
closer to the so-called ‘main sequence’ of star-forming galaxies’ than 
previously thought. Classical ideas about the evolutionary tracks of 
galaxies” and our understanding of cosmic star-formation history” 
are challenged. Fundamental parameters governing galaxy formation 
and evolution—star-formation rates, stellar masses, gas-depletion and 
dust-formation timescales, dust extinction laws, and more”’—must be 
re-addressed, exploiting recent advances in stellar physics. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0196-x. 
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METHODS 


Sample. Our sample comprises the strongest CO emitters in the early Universe, 
taken from the literature*'~**: four strongly lensed submillimetre galaxies at 
zr 2-3, with look-back times exceeding ten billion years. Two of these galax- 
ies, SPT-S J010312—4538.8 (z= 3.09, also known as SPT 0103—45) and SPT-S 
J012506—4723.7 (z=2.51, also known as SPT 0125—47), were selected* using the 
South Pole Telescope at wavelengths \= 1.4mm and 2mm; another, H-ATLAS 
J090302.9—014127 (z=2.31, also known as SDP.17b), was discovered using the 
Herschel Space Observatory” at far-infrared wavelengths; the last, H1413+117 
(z= 2.56, the ‘Cloverleaf’ quasar), was discovered as a result of its rare quadruple- 
spot optical morphology, and was later found to be bright in CO and in the 
dust continuum*)*°. We list the basic characteristics of the sample in Extended 
Data Table 1. 

Observations and data reduction. We have performed simultaneous observa- 
tions of °CO and C!80 using ALMA in its relatively compact array configura- 
tions (C36-1 and C36-2), with two 2-GHz-wide spectral windows in bands 3 and 
4. We used the remaining two spectral windows to cover continuum emission. 
Between 10 min and 30 min were spent on target for each transition. For SDP.17b, 
we observed both the = 3 2 and /=4—3 transitions of 3CO and C!80, in order 
to have a redundant measurement for their line ratios as well as constraints on the 
relative excitation in these rare isotopic lines. We also observed *CO J=5— 4 for 
SPT 0103—45 and CO J=4-—3 for SPT 0125—47, with similar configurations. 
Calibrators, integration time and atmospheric conditions are listed in Extended 
Data Table 2. 

All the data were calibrated manually using CASA v4.7.1°°, using standard 
procedures. We subtracted the continuum using the CASA task, uvcontsub, by 
fitting a linear slope to the line-free channels. We cleaned the visibility data with a 
channel width of about 20-30km s~!, using a Briggs weighting with robust = 1.5 
to optimize sensitivity. We applied a primary beam correction to all the cleaned 
data. Our target galaxies are mostly unresolved, or only marginally resolved. We 
assume that linewidths of CO and C!8O are the same as those of !*CO lines, to 
minimize uncertainties in the line flux fitting. Extended Data Figs. 1-3 present 
the velocity-integrated flux (moment 0) maps of !°CO and C80, overlaid with 
contours of high-resolution submillimetre continuum. In Extended Data Fig. 4 we 
present the spectra. SPT 0103—45 has two velocity components that cover a very 
large velocity span. The overall line profile of !*CO is consistent with CO, but 
limited by the noise level. We adopt only the narrow (stronger) component, seen 
for the yellow shadow region in the "CO J=5 4 spectrum, to avoid confusion 
from the broad (weaker) component (see Extended Data Fig. 4). Our synthesized 
beam sizes are mostly larger than, or at least comparable to, the apparent sizes 
revealed by the high-resolution submillimetre continuum images, so any missing 
flux is expected to be negligible. We extract spectra of '*CO and C’*O using cir- 
cular apertures equal to 4”-6” in diameter, as shown in Extended Data Figs. 1-3. 

To measure velocity-integrated line fluxes and the associated errors, we per- 

formed three independent methods: we first fitted one-dimensional Gaussian 
profiles to the extracted spectra with a fixed linewidth from !*CO, and fixed the 
frequency interval between !°CO and C180, since we can assume confidently that 
these two lines are emitted from the same excitation component, such that their 
line centres do not shift relative to one another. Second, we made moment-0 maps 
and fitted two-dimensional Gaussian profiles, as an independent check of line flux. 
Third, to better estimate the noise level, we also calculated the theoretical noise 
using the ALMA sensitivity calculator (https://almascience.eso.org/proposing/ 
sensitivity-calculator), given the integration time, precipitable water vapour, line- 
width and array configuration. The measured line fluxes and properties are listed 
in Extended Data Table 3. To be conservative, in Figs. 1 and 2 we adopt the largest 
error among the results from the three methods in the analysis. 
Line ratios and optical depths. Conditions of LTE. We first analyse the observed 
line ratios of "CO to C180, to constrain the molecular line optical depths, assum- 
ing LTE (Tii"® = T}'**). The line brightness temperature ratios of CO to 8CO and 
CO to C80 can be expressed as: 
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where Ty is the brightness temperature of the molecular transition, T., is the excita- 
tion temperature, and Tp, is the radiation temperature of the background emis- 
sion field, which is dominated by the cosmic microwave background, following 
Toop © 2.73 X (1 +z) K, where zis the redshift. r'"* is the optical depth of the given 
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transition. J,(T) = (hv/kg)/{exp[hv/(kgT)] — 1} is the Planck radiation temperature 
at the rest frequency of the line emission, vine, kp is the Boltzmann constant, h is 
the Planck constant, and T is the temperature considered. For optically thick lines 
(for example, !*CO for most conditions), 1 — exp(—r'™*) ~ 1; for optically thin 
lines (for example, CO and C!80), 1 — exp(—r!"*) ~ rine, 

In Extended Data Fig. 5 we present the I(!3CO)/I(C'80) and I(??CO)/I(°CO) 
velocity-integrated line intensity ratios as a function of the optical depths of 4CO 
and C'80, under LTE conditions. We also calculated the corresponding H, column 
densities, assuming Galactic abundances*”"*, A representative Galactic abundance 
ratio of *?CO/CO = 70 is assumed, and the optical depth for !*CO needs to be 
<0.03, which is around 10x lower than the Galactic average values" for producing 
the observed high I(!2CO)/I(1?CO) ratios, >30. The corresponding optical depth 
of CO is about 2, much lower than the typical values for 2COJ=1-40 found in 
typical Galactic molecular clouds", but consistent with a moderate optical depth 
of CO found in local starburst galaxies”. 

Only when the optical depth of C!O is much greater than unity (corresponding 

to r’©° being much greater than 7, which makes the molecular hydrogen column 
density Nu, much greater than 107° cm~?), does the I(!3CO)/1(C!80) line ratio 
approach unity. In this case the line ratios of I(!*CO)/I(!3CO) and I(?CO)/1(C8O) 
would also move towards unity, in conflict with our observed ratios. Even for 
moderate values of T ©° + 0.2-0.5, the line ratio of I(/3CO)/I(C!8O) stays at about 
6-7, not strongly biased by 7°©°. 
Non-LTE conditions. In Extended Data Fig. 6, we present non-LTE models derived 
with a non-LTE radiative transfer code, RADEX*®, showing the optical depth, 
1(?CO)/1(C!8O) and I(!2CO)/I(CO) line ratios as a function of CO column 
density and molecular gas column density, N,, |. We calculate for different H, vol- 
ume densities, ny» of 10? cm~?, 104 cm~? and 10° cm~?, covering the most com- 
mon ny;, range—typical values from normal molecular clouds to dense cores. We 
assume the same abundance ratios as we assumed for the LTE conditions, that is, 
®CO/3CO =70 and 3CO/C'80 =7, which are values representative of the Milky 
Way disk. The velocity width (full-width at half-maximum, FWHM) is set to 
300km s~1, as the typical (indeed, at the lower end) linewidth found in ULIRGs 
and submillimetre galaxies*’. In Extended Data Fig. 6b and c we overlay the LTE 
results, for comparison. 

For all models, we set the kinetic temperature, Tin, to be 30 K, which is a typ- 
ical dust temperature for the submillimetre galaxy population”, and is also the 
lower limit of the kinetic temperature of the H; gas, as the minimum temperature 
powered by the cosmic-ray heating for such starburst conditions**. Higher Tyin 
would bring the CO energy population towards higher-/ transitions, making opti- 
cal depths even smaller. 

Extended Data Fig. 6 shows that, for ny, = 10° cm~, which is a typical value 
for normal Galactic molecular cloud conditions, only when N,,, is much greater 
than 1076 cm~?, the line ratio of I(3CO)/I(C8O) can approach unity (<1.5, con- 
sidering the uncertainties in line ratios). The required high column densities are 
a few orders of magnitude higher than the typical values measured in submillime- 
tre galaxies: about 107-1074 cm~?, obtained using X-rays“4, CO radiative transfer 
modelling*’, and dust. This is especially supported by the Cloverleaf quasar, 
whose X-ray emission has been clearly detected“®, given the Compton limit of 
about 1074 cm7?. 

On the other hand, the high-density results, that is, forn,,, = 10* cm~? and 10° 

cm~°, are very similar to those under LTE conditions. For all conditions, a mod- 
erate !3CO optical depth does not vary the I(!3CO)/I(C!80) ratio much from the 
abundance ratio. So, it is highly unlikely that the unity value of the I(!°CO)/I(C'80) 
ratio can be caused by high optical depths. 
Possible HNCO contamination of C!*O lines. HNCO 595 — 44 (J=5 — 4) has 
a rest frequency of 109.9058 GHz, close to the rest frequency of C80 J=1—0 
(109.7822 GHz), with a velocity offset of about 370km s~!. When the linewidths 
are broad, these two lines are sometimes blended, which leads to a possible con- 
tamination of the C!8O measurements!?4”48, In these observations it has been 
found that HNCO J=5-—4 could contribute up to about 30% of the total flux 
(C'8O + HNCO), for the most extreme cases in the local Universe, for example, 
Arp 220% and IRAS 13120-5453”. 

HNCO J=5—4 has a critical density, neit © 10° cm~ and is regarded as a 
dense-gas tracer that could be excited in slow-shock regions***”. The Einstein 
A coefficient increases as A x (J+1)°, so Merit increases quickly for high-J 
transitions, such as HNCO J= 15 — 14 (res, = 329.66 GHz) and J= 20-19 
(Vrest = 439.62 GHz), contaminating the CO J=3— 2 and J=4 —3 transitions 
involved in our study much less. 

To estimate better how much HNCO lines may contaminate the C!8O lines, in 
Extended Data Fig. 7 we show the theoretical line ratios between HNCO and C80 
and high-J I(HNCO)/I(C!80) ratios normalized with I(HNCO J=5— 4)/I(C80 
J=1-0), using RADEX“®. We assume the same abundances measured in Arp 
220%”, and use molecular data from the Leiden Atomic and Molecular Database! 
(LAMBDA). We assume Tiin = 30 K to be the representative temperature of the Hp gas. 
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Extended Data Fig. 7a shows that the I(HNCO)/II (C!80) ratio increases with 
ny: Moreover, the ratio decreases quickly with J transitions, meaning that the 
contamination from HNCO to C!8O is much less severe for the high-J transitions, 
compared to that of the C'8O J=1-+0 line. Extended Data Fig. 7b shows 
I(HNCO)/I(C80) line brightness ratios normalized by I(HNCO J=5— 4)/I(C8O 
J=1-+0). With a weak dependency on ny,» the ratios of I(HNCO 
J=10—9)/I(HNCO J=5 — 4) and I(HNCO J= 15 — 14)/I(HNCO J=5 — 4) are 
one order of magnitude lower than unity. Unfortunately the LAMBDA database 
does not have the data for the transition of HNCO J=20— 19, whose HNCO/ 
C180 ratio is expected to be even lower. 

Even if we take the highest HNCO J=5 — 4 contamination found in local galax- 
ies, that is, 30%, the corresponding contamination for the high-J C!8O transitions 
will be at most 3%, which can be regarded as negligible for the lines in our study. 
Chemical evolution model. We adopt a single-zone chemical evolution model for 
our analysis, originally developed to describe the evolution of the Milky Way™, 
then further extended to other galaxies**. The model computes the evolution of 
abundances of multiple elements, including !7C, !°O, °C and !8O in the ISM of 
galaxies. We use detailed numerical models to solve the classical set of equations 
of chemical evolution!°*?-°, with the following assumptions: 

(1) Gas inflow with primordial chemical composition provides raw material for 
star formation. The gas is accreted at an exponentially fading rate and the timescale 
of the process is a free parameter of the model. 

(2) Galactic outflows remove both the stellar ejecta and a fraction of the 
ambient ISM. 

(3) Star formation follows the canonical Kennicutt-Schmidt law’; the masses 
of the newly-formed stars follow the input IMF. 

(4) Finite stellar lifetimes for different stars need to be considered (that is, no 
instantaneous recycling approximation (non-IRA) is adopted)°*. 

(5) Stars release the elements they have synthesized during their lifetime, as 
well as those already present when they were born that are left unaltered by the 
nucleosynthesis processes, when they die. 

(6) Stellar ejecta are mixed with the ISM homogeneously. 

The yields adopted account for the dependence of several stellar processes on 
the initial metallicity of the stars, and have been calibrated with the best fit using 
the Milky Way data, which are relevant to a range of metallicity and evolution 
timescales®. The time-delay effect is considered in the chemical evolution, namely 
the differences between the lifetimes of massive stars and low-mass stars*’. We 
used an analytical formula for the stellar lifetimes that linearly interpolate stellar 
lifetime tables**. The time lag in producing and releasing primary (those synthe- 
sized directly from H and He; that is, *C and '°O) and secondary elements (those 
derived from metals already present in the star at birth; that is, &C and *O—but 
note that a fraction of '°C is also synthesized as a primary element) is also con- 
sidered®?. These two effects, corroborated by the star-formation history and the 
IMF adopted, determine the amount of different isotopes released to the ISM on 
different timescales. In particular, the bulk of #C is released later than °C, and the 
bulk of !8O is released later than !©O. Chemical evolution models can now follow 
the evolution of various isotopic ratios, tracing abundance ratios not only between 
the isotopes of each element’, but also between different elements. 

The most important aspect regarding this work is that such models can now 
compare the effects of a young starburst (with a regular stellar IME, for example, 
the Kroupa IMF) against those due to different stellar IMFs via carefully cho- 
sen isotope (and thus isotopologue) ratios. This critical advance was made by no 
longer assuming instantaneous element enrichment of the ISM by the stars, but 
incorporating the different timescales of their release into the ISM. It should be 
noted that these timescales, and the relative delays between the release of various 
isotopes into the ISM, are set by stellar physics, that is, they are not free parameters. 
With the Kroupa IMF, only an unphysical combination of star-formation history, 
that is, for a starburst timescale 7 < 10 Myr and a star-formation rate of more than 
20,000Mo yr~!, could approach the observed !3C/18O ratios of near unity. 
Origins of carbon and oxygen isotopes. The I(!?CO)/I(°CO) line ratios have 
been found to vary systematically in galaxies with different star-formation rates 
and Hubble types*”*'. Owing to the differences in the origins of '*C and '°C, 
it has been proposed that the I(}7CO)/I(°CO) line ratios can be used to derive 
their abundance ratio, which can further probe the stellar IMF, or different 
star-formation modes!*360, 

The °C element is primarily produced by helium burning (the classical triple-ca 
process), and multiple channels can produce ?C in nucleosynthesis®™. In the Milky 
Way, '*C is primarily produced by low- and intermediate-mass stars, revealed by 
data for stars in the solar vicinity: in fact, the [C/Fe] ratio as a function of [Fe/H] is 
almost constant, with [C/Fe] being solar, indicating that C and Fe are produced in 
the same proportions by the same stars”“*, If mass loss from massive stars is consid- 
ered, !7C released from massive stars still accounts for <50% of the total®°, On the 
other hand, }3C is released mainly from low- and intermediate-mass stars largely 
as a secondary element, because °C production needs a pre-existing seed, namely, 


the primary element, '7C%*”, '°C also has a primary component in nucleosyn- 
thesis but it can only occur in red asymptotic giant branch stars, where periodic 
dredge-up episodes convect 'C to the stellar surface and form °C. 

Since both *C and 13C are mostly produced by low- and intermediate-mass 
stars, the ?C/3C ratio cannot discriminate between IMFs unambiguously. In pre- 
vious work, we show that by switching from the Ballero IMF"* (a very top-heavy 
IME, which can reproduce the chemical abundances of stars in the Galactic bulge) 
to the Kroupa IME, the '7C/°C ratio varies® only by a factor of 2, indicating that 
carbon isotopologue ratios are not very sensitive to IMF. Furthermore, the 7C 
abundance is very difficult to obtain because the '*C-bearing major isotopologue 
lines are mostly optically thick. 

The origin of oxygen isotopes is rather different. As earlier work suggested®, the 
stellar yields of 16O, and !8O are sensitive to different stellar masses, owing to their 
temperature sensitivity in stellar nucleosynthesis®. Production of O is dominated 
by massive stars™4, as revealed by chemical evolution models in the Milky Way using 
detailed stellar yields®. Only a tiny fraction of !°O is contributed by asymptotic giant 
branch stars”. Massive stars also dominate the production” of !80, predominantly 
in the early stages of helium burning®. As a secondary element, the '8O yield relies 
strongly on the pre-existence of !°O, so the metallicity in oxygen also plays a major 
part in producing '8O. The production of !80 is biased to more massive stars com- 
pared to °C which is more biased to low- and intermediate-mass stars. So, the 
abundance ratio of °C and !8O does indeed reflect different IMFs (see Fig. 2). 

The abundance ratios of !*C/C and !°O/'8O can trace star-formation timing 
and IMF*, respectively. The ?C and !6O abundances are compromised by the opti- 
cal depths of molecular lines, which are difficult to estimate accurately. However, 
the combination of both carbon and oxygen isotopologues—the abundance ratio 
of '3C and '8O—can be obtained easily from the observed intensity ratio of two 
optically thin lines, *CO and C!80, in the same J transition. Moreover, these two 
lines can be obtained simultaneously using current facilities, owing to the close 
spacing of their rest frequencies. They have almost identical critical densities and 
upper energy levels, essentially free from excitation differences. Even for strongly 
lensed galaxies, it is safe to assume that any differential lensing effect between the 
two lines is negligible. 

Code availability. We opt not to make the code used for the chemical evolution 
modelling publicly available because it is an important asset of the researchers’ 
toolkits. The code for analysing the line ratios and optical depths of '*CO, 3CO, 
C180 are based on the publicly available non-LTE radiative transfer code, RADEX 
(https://personal.sron.nl/~vdtak/radex/index.shtml). 

Data availability. The dataset that supports the findings of this study is available in 
the ALMA archive (http://almascience.eso.org/aq/) under the observing projects 
2015.1.01309.S (http://almascience.eso.org/aq/?projectcode=2015.1.01309.S), 
2013.1.00164.S (http://almascience.eso.org/aq/?projectcode=2013.1.00164.S), 
2011.0.00958.S (http://almascience.eso.org/aq/?projectcode=2011.0.00958.S) 
and 2011.0.00747.S (http://almascience.eso.org/aq/?projectcode=2011.0.00747.S). 
Additional requests can be directed to the corresponding author. 
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Extended Data Fig. 1 | Velocity-integrated flux maps (moment 0) of 
3CO and C!80 for SDP 17b. Black contours show the high-resolution 
250-GHz continuum image, obtained from the ALMA archive”®, with 
levels of 30, 100 and 500 (o =0.6 x 107! mJy beam). Dashed red 
circles show the adopted apertures for extracting spectra. a, b, Images 
of 3CO and C!80O for the J = 3 — 2 transition. White contours show the 
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95-GHz continuum, with levels of 30, 5a and 100 (0 =1.7 x 10°-* mJy 
beam~'). c, d, Images of CO and C!80 for the J=4 — 3 transition. White 
contours show the 133-GHz continuum, with levels of 30, 50 and 100 

(o =2.3 x 10°? mJy beam™!). The corresponding synthesis beams (white 
for '3CO and C'8O, and black for the 250-GHz continuum) are plotted in 
the bottom left. 
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Extended Data Fig. 2 | Velocity-integrated flux maps (moment 0) of emission, with levels of 30, 100 and 300 (0 =0.14 Jy beam™!km s7!). 
13CO and C!8O J=5 — 4 for SPT 0103—45 and the J = 3 — 2 transition White contours show the 135-GHz continuum, with levels of 30, 100 and 
in SPT 0125—47. Black contours show the high-resolution 336-GHz 300 (0 =2 x 10°? mJy beam“). c, d, Images of '*CO and C80 for the 
continuum image, obtained from the ALMA archive”’, with levels of 30, J=3-—2 transition in SPT 0125—47. White contours show the 94-GHz 
10¢ and 300 (9 =2.3 x 10-? mJy beam '). Dashed red circles show the continuum, with levels of 3c, 5a and 100 (o =2.2 x 10°? mJy beam“'). 
adopted apertures for extracting spectra. a, b, Images of *CO and C8O The corresponding synthesis beams (white for '*CO and C!8O, and black 


J=5-—4 for SPT 0103—45. Blue contours show the narrow '*CO J=4-+3 _ for the 336-GHz continuum) are plotted in the bottom left. 
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Extended Data Fig. 3 | Velocity-integrated flux maps (moment 0) of 
3CO and C!80 for the J =3 — 2 transition in the Cloverleaf quasar. 

a, Image of the '°CO J=3 — 2 transition. b, Image of the C180 J=3— 2 
transition. Black contours show the high-resolution 690-GHz continuum 
image, obtained from the ALMA archive’, with levels of 30, 5 and 10 
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(oc =0.8 mJy beam’). Dashed red circles show the adopted apertures for 
extracting spectra. White contours show the 92-GHz continuum, with 
levels of 30, 50 and 100 (o =2 x 10~* mJy beam !). The corresponding 
synthesis beams (white for *CO and C'O, and black for the 690-GHz 
continuum) are plotted in the bottom left. 
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Extended Data Fig. 4 | ALMA spectra of the observed '*CO, CO and 
C80 transitions. a, ALMA spectra of 7CO in SPT 0125—47 and SPT 
0103—45. Yellow shading shows the velocity range adopted from *CO in 
the analysis. b, ALMA spectra of !?CO and C80 for all targets. All spectra 
are in black. Red lines show Gaussian fits to the observed lines. Velocities 
are labelled relative to their *CO or 3CO transitions. 
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Extended Data Fig. 5 | I(3CO)/I(C!8O) and I(!7CO)/I(3CO) line ratios 
as a function of optical depth of !*CO, under LTE conditions. 

a, I(#3CO)/I(C!80) line ratio as a function of optical depth of !3CO. 

b, 1(*CO)/I(3CO) line ratio as a function of optical depth of 8CO. Both 
ratios assume LTE conditions. We assume the abundance ratios of '*CO/ 
C80 and '*CO/CO are 7 and 70, respectively, which are representative 
values found in the Milky Way. This shows that the I(!*CO)/I(C'80) line 
ratio approaches unity (blue line) only when the optical depth of C180 is 
greater than or equal to 1 (and the corresponding optical depth r''©° = 7). 
The bottom scale bar shows the corresponding N,,,, assuming a CO/H 
abundance’® of 8.5 x 10~°. rand R are the intrinsic abundance ratio and 
measured line brigntness ratio, respectively. 
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Extended Data Fig. 6 | Optical depths, I(!7CO)/I(C'8O) and 
I(}?CO)/I(3CO) line ratios as a function of H, column density, under 
non-LTE conditions. a, Optical depths of *CO, CO and C!8O, for the 

J =3 — 2 transition; b, I(!3CO)/I(C!80) line ratio, and ¢, I(!2CO)/I(7CO) 
line ratio as a function of H, column density, Nj, and BCO column 
density in various physical conditions, for non-LTE models calculated with 
RADEX“. For all models, we set the abundance ratios of CO, !3CO and 
C!80 to be Galactic: *CO/!3CO =70 and 3CO/C80 =7, which are 
representative values of the Milky Way disk. Different line styles show the 


gas conditions of H) volume densities, n,,, = 10? cm~, 104 cm~? and 


10° cm~?. The Tin value for all models is set to 30 K, which is a typical dust 
temperature for the submillimetre galaxy population, and the lowest Tin 
that H) gas can reach for such intensive starburst conditions, due to cosmic 
ray heating**. In b and c, we also overlay the line ratios (in thick green 
lines) with the LTE assumption for comparison. All three panels show that 
for Galactic abundances the line ratio of '?CO/C!80 can approach unity 
only when the '?CO column density is higher than 10'°-10*° cm? (that is, 
H; column density Ny, > 1075-107 cm~?). 
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Extended Data Fig. 7 | I(HNCO)/I(C18O) line ratio and normalized 
I(HNCO)/I(C180) ratio as a function of H2 volume density. 

a, I(HNCO)/I(C'80) line ratio as a function of Hz volume density. 

b, I(HNCO)/I(C'80) line ratio as a function of H2 volume density, 
normalized with I(HNCO J =5 — 4)/I(C!8O J= 1-0). Both ratios are 
calculated using RADEX”’, in which we assume the same abundances as 
measured in Arp 2204”. We assume Tin = 30 K as the representative kinetic 
temperature of the Hp gas. 
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Extended Data Table 1 | Target properties 


Short name IAU name R.A. Dec. Redshift Lensing Dr/u M,/u 
J2000 J2000 Zz amplification, zu 108Lo 10!°M 
SPT0103-45 SPT-SJ010312-4538.8 01:03:11.50 —45:38:53.9 3.0917 5.3 40.11 1.2 5.5084 
SPT 0125-47 SPT-SJ012506-—4723.7 01:25:07.08 -47:23:56.0 2.5148 5.5+0.1* 2.2 - 
SDP.17b HATLAS J090302.9014127 09:03:03.02 —01:41:26.9 2.3051 3.561017 2.0 24.286 
Cloverleaf H1413+117 14:15:46.23 = + 11:29:44.0 2.5585 117 6.0 - 


Data for SDP.17b are from refs 718°, IAU, International Astronomical Union, R.A., right ascension, Dec., declination. 

*We adopt an amplification factor, j:, derived from lens modelling using 850-1m ALMA visibility data in the literature’?:79. 
tWe adopt the best-determined amplification factor, ., reported in the literature?>’7°, whose lensing model is derived from CO J=7 6 line emission, and could better reproduce a single source on the 
lens plane’, Unfortunately the uncertainty of this amplification factor was not reported, but the uncertainty in j. does not jeopardise our conclusions. 
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Extended Data Table 2 | ALMA observational information 


Target SPT0103-45 SPT0103-45 SPT0125-47 SPT0125-47 SDP.17B SDP.17B Cloverleaf 
lsotopologue BCo, CBO 2Co Bco, c#o 2Co BCo, CBO BCo, CBO BCo, c#O 
Transition J=5-4 J=5-4 J=3>2 J=453 J=3-2 J=433 J=3->2 
Observing Date 21-Jan-2016 21-Jan-2016 21-Jan-2016 21-Jan-2016 17-Jan-2016 16-Jan-2016 08-Apr-2016 
Bandpass Calibrator J2357-5311 J2357-5311 J2357-5311 J2357-5311 J0854+2006 J0854+2006 J1337-1257 
Flux Calibrator Neptune J2357-5311 Neptune Neptune J0854+2006 J0854+2006 Callisto 
Gain Calibrator J0056-—4451 J0051-—4226 J0124—-5113 J0124—-5113 J0909+0121 J0909+0121 J1415+1320 
Integration Time (s) 1300 120 605 120 816 726 1753 
Median PWV (mm) 6.1 5.6 6.2 6.0 2.0 3.2 3.0 
Median T,,, (K) 86 85 93 88 §2 60 70 
Angular Resolution 2.51.8” 2.7” x1.6” 3.6” x2.5” 2.5’"«1.8” 3.12.3” 2.7” 1.8” 3.7” x2.0” 


PWV, precipitable water vapour. 
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Extended Data Table 3 | Observed targets, lines, frequencies, linewidths and fluxes 


Target Transition Yobs line Jront ote 5 AViine  Frpeak 
J>J-1 GHz Jykms7! Jykms-! Jykms“! kms“! mJy 
SDP.17b RCOsJ=433 139.49 9120.3 - - 320 ~40 
SDP.17b BCOJ=32 100.02 0.32+0.05 0.34 + 0.08 0.08 0.9 + 0.3 
SDP.17b CBO yJ=3>2 99.64 0.26 + 0.05 0.32 + 0.08 0.08 0.8 + 0.3 
SDP.17b BCOJ=43 133.36 0.47+0.07 0.46 + 0.08 0.07 1.3+0.4 
SDP.17b CBO J =433 132.85 0.34+0.06 0.50+0.08 0.07 1.0+0.4 
Cloverleaf R2COsJ=3>2 97.17 13.2+0.2 - - 400 30 + 1.7 
Cloverleaf BCOJ=3—>2 92.90 0.65 + 0.09 0.61+0.06 0.07 14+0.4 
Cloverleaf c8O J =332 92.55 0.40 + 0.10 0.43 + 0.06 0.07 0.8 + 0.4 
SPT 0103-45 2oos=433 112.68 8.2+0.6 - - 32 + 0.6 
SPT 0103-45 2C¢0OJ=54 140.91 8.8 + 0.57 8.8 + 0.67 - 300* 27.8 + 0.2 
SPT0103-45 BCOJ=5>4 134.65 0.3740.07 0.38+0.05 0.07 1.2404 
SPT 0103-45 CBO y=5 54 134.13 0.35+0.09 0.39 + 0.07 0.07 1.2+0.4 
SPT 0125-47 260 J=32 98.38 18.14+0.5 18.0+0.5 - 4344 
SPT 0125-47 2Cos=4-33 131.21 26.9 + 0.7 26.8 + 0.7 - 400 69 +3 
SPT 0125-47 BCOJ=3>2 94.06 0.78 + 0.09 0.86 + 0.07 0.1 2.0+0.4 
SPT 0125-47 CBO y=332 93.70 0.63 + 0.07 0.71+0.1 0.1 1.6+0.4 


Data are from refs 32:74,75, T.,, is the system temperature. 


«Theoretical noise level calculated using the ALMA sensitivity calculator (https://almascience.eso.org/proposing/sensitivity-calculator). 
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There are two velocity components in the !2CO spectrum. We adopt only the narrow component, seen for !2CO J=5 — 4, to avoid the broad and weaker component (see Extended Data Fig. 4). 
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Deterministic quantum state transfer and remote 
entanglement using microwave photons 


P. Kurpiers!**, P. Magnard4, T. Walter!, B. Royer’, M. Pechal', J. Heinsoo!, Y. Salathé!, A. Akin!, S. Storz!, J.-C. Bessel, 


S. Gasparinetti!, A. Blais?? & A. Wallraff!* 


Sharing information coherently between nodes of a quantum 
network is fundamental to distributed quantum information 
processing. In this scheme, the computation is divided into 
subroutines and performed on several smaller quantum registers 
that are connected by classical and quantum channels!. A direct 
quantum channel, which connects nodes deterministically rather 
than probabilistically, achieves larger entanglement rates between 
nodes and is advantageous for distributed fault-tolerant quantum 
computation”. Here we implement deterministic state-transfer 
and entanglement protocols between two superconducting 
qubits fabricated on separate chips. Superconducting circuits? 
constitute a universal quantum node?‘ that is capable of sending, 
receiving, storing and processing quantum information**. Our 
implementation is based on an all-microwave cavity-assisted Raman 
process”, which entangles or transfers the qubit state of a transmon- 
type artificial atom’? with a time-symmetric itinerant single photon. 
We transfer qubit states by absorbing these itinerant photons 
at the receiving node, with a probability of 98.1 +0.1 per cent, 
achieving a transfer-process fidelity of 80.02 + 0.07 per cent fora 
protocol duration of only 180 nanoseconds. We also prepare remote 
entanglement on demand with a fidelity as high as 78.9 + 0.1 per 
cent at a rate of 50 kilohertz. Our results are in excellent agreement 
with numerical simulations based on a master-equation description 
of the system. This deterministic protocol has the potential to be 
used for quantum computing distributed across different nodes of 
a cryogenic network. 

Remote entanglement has been realized probabilistically using her- 
alded or unheralded protocols based on measurement projection!"4, 
single-'>!6 or two-photon'”~”° detection or direct transfer of a single 
photon’), See Methods and Extended Data Fig. 1 for an overview 
of selected experimental results, including a discussion of concur- 
rent deterministic experiments performed with superconducting cir- 
cuits?®*4, However, a fully deterministic implementation” of direct 
transfer protocols is more challenging to realize. In the protocol”, a 
stationary atom is coupled to a single-mode cavity in remote quantum 
nodes and a coherent drive entangles the state of the atom with the field 
of the cavity. The cavity is coupled to a directional quantum channel 
into which the field is emitted as a time-symmetric single photon. This 
photon travels to the receiving node where it is ideally absorbed with 
unit probability using the time-reversed coherent drive (Fig. 1a). In 
addition to establishing entanglement between the nodes, direct trans- 
fer of quantum information offers the possibility to transmit arbitrary 
qubit states from one node to the other. 

In our adaptation of this scheme (Fig. 1b) to the circuit quantum 
electrodynamic architecture, each quantum node (labelled A and B) is 
composed ofa superconducting transmon qubit with transition fre- 
quency Vie = 6.343 GHz or Vee = 6.093 GHz dispersively coupled to two 
coplanar microwave resonators, analogous to an atom coupled to two 
cavity modes. One resonator is dedicated to dispersive transmon read- 
out and the other to excitation transfer. The transfer resonators at the 


two nodes are tuned to have matching frequencies vr + 8.400 GHz and 
large bandwidths k:7/(27) of the order of 10 MHz (see Methods). All 
resonators are coupled to dedicated filters, to protect the transmons 
from Purcell decay”*””. An external coaxial line with a length of 0.9 m, 
bisected with a circulator, connects the transfer circuits of both chips. 
With this set-up, photons are routed from node A to B and from node 
B toa detection line. If perfect absorption of the photon can be realized 
and independent detection of the photon is not needed or desired, then 
the circulator can be omitted from the circuit”®. To generate a control- 
lable light-matter interaction, we apply a coherent microwave tone to 
the transmon, which induces an effective interaction ¢(t) with tunable 
amplitude and phase®”® between states |f,0) and |g,1). Here, |s,) 

denotes a Jaynes-Cummings dressed eigenstate with transmon state 
|s) and Fock state of the transfer resonator |n) . The two lowest-energy 
eigenstates (|g) and |e)) of the transmon form the qubit subspace; the 
second excited state (|f)) is used as an auxiliary level to control the 
light- matter interaction in our experiment. This interaction swaps an 
excitation from the transmon to the transfer resonator, which then 
couples to a mode propagating towards node B. By controlling ¢(t) 
(see Methods), we shape the itinerant photon to have a time-symmetric 
envelope $(t) = = [Rett sech (K,,;¢ /2), with an adjustable photon band- 
width ‘eg limited only by «7. By inducing the reverse process 
|g, 1) < |f, 0) with the time-reversed amplitude and phase profile of 
&(t), we absorb the itinerant photon in the transmon at node B. Ideally, 
this procedure returns all photonic modes to their vacuum state. We 
note that in our system this process could also be implemented with 
asymmetric photon shapes, or ones with a more structured time 
dependence”, as long as the physical constraints on the bandwidth 
required for its emission and absorption are met at the respective sites. 

To characterize the excitation transfer, we start by initializing the 
transmon in its ground state*°, after which we apply a sequence of two 
T pulses (R*, Rj;) to prepare the transmon at the receiving node B in 
state |f,0). Next, we induce the effective coupling g(t) with a 
modulated drive R7,,, to emit a symmetric photon? (Fig. 2a). We vary 
the instantaneous fre requency of Rj,,, to compensate for the drive- 
amplitude-dependent a.c. Stark shift of the |f,0) < |g, 1) 
transition (see Methods). Here, and in all subsequent measurements, 
the population of the transmon states are extracted using single-shot 
readout with a correction to account for measurement errors 
(see Methods). The populations of the three lowest levels of the 
transmon P,., are measured immediately after truncating the 
emission pulse R7 fog: at time 7 (Fig. 2b). In this way, we observe that the 
transmon evolves smoothly from |f) to |g) during the emission pro- 
cess. At the end of the protocol, the emitting transmon reaches a 
ground-state population of Ps =95.8%, which characterizes the emis- 
sion efficiency. 

To verify that the envelope of the emitted photon has the target shape 
and bandwidth «,f,/ (27) =10.6 MHz, we repeat the emission protocol 
with an initial transmon state (|g) + |f)) /-/2 and measure the aver- 
aged electric-field amplitude (a,,,,(t)) « d(t) of the emitted photon 


out 
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Fig. 1 | Schematic and measurement set-up. a, Quantum optical 
schematic of a deterministic unidirectional entanglement protocol 
between two cavity quantum electrodynamic nodes of a quantum network. 
At the first node, a three-level system is prepared in its second excited state 
[f) (grey half-circle) and driven coherently (¢(f), blue arrow) to |g) (blue 
half-circle), creating the transfer cavity field |1) (light yellow). The cavity 
field couples into the directional quantum channel with rate « as a single- 
photon wavepacket with an effective bandwidth keg (yellow hyperbolic 
secant shape). In the second quantum node, the time-reversed drive ¢(—t) 
transfers the excitation from |g) to |f) in the presence of the transferred 
photon field |1) . Finally, the protocol is completed with a transfer pulse 
between |f) and |e) (red half-circle) to return to the qubit subspace. In 
addition, each three-level system is coupled to a readout cavity (grey). 

b, Implementation of the system depicted in a in a planar, chip-based, 
circuit quantum electrodynamic architecture (Extended Data Fig. 2). At 
each node, a transmon (orange) is coupled capacitively to two \/4 coplanar 
waveguide resonators and Purcell filter circuits”’ that act as the transfer 
(yellow) and readout (grey) cavities, respectively. The output transmission 
lines are coupled galvanically to the corresponding circuit. A directional 


state (|0) + |1)) /-/2 using heterodyne detection*! (Fig. 2c). We pre- 
pare this photon state because of its non-zero average electric field”. 

Repeating the emission protocol from node A leads to similar 
dynamics of the transmon population (Fig. 2e). We adjust the amplitude 
and phase of the transfer pulse (Fig. 2d) so that the photons emitted from 
each node A and B have similar effective bandwidth «in spite of their 
respective transfer resonator bandwidths « differing by approximately 
30% (see Methods). The detected integrated power A | (aour(t)) |? dt 
of the photon emitted from node A (Fig. 2f) is lag = 23.0% + 0.5% lower 
than that emitted from node B owing to loss accumulated as the photon 
travels from node A to B. The photon loss [43 is extracted from the ratio 
of the integrated photon powers for emission from nodes B and A 
(see Methods). In addition, the envelope of the photon emitted from 
node A is slightly distorted by the reflection off node B, as determined 
by the response function of its transfer resonator, which is fully 
captured by our theoretical model. 

To characterize the absorption of the single time-symmetric photon 
emitted from node A at the receiving node by time-reversing the emis- 
sion pulse of node B (Fig. 2a, g), we measure the population of trans- 
mon B during the process. We apply a 7 pulse to transmon B to map 
If) back to the qubit subspace before performing the readout. We 
observe the population of |e) to rise smoothly and saturate at 
PS" = 67.5% (Fig. 2h). This saturation level reflects the efficiency of the 
protocol for the transfer ofa single excitation (a single photon), which 
is executed in a pulse sequence of only 180-ns duration (Fig. 2g). From 
the ratio of the integrated power of the emitted photon in the absence 
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quantum channel is realized using a semi-rigid coaxial cable and a 
circulator connecting to the output port of the transfer circuit Purcell filter 
at each node. ¢, d, Details of the circuit quantum electrodynamic 
implementation. c, Combined qutrit (vg) and [f, 0) < |g, 1) transition 
(Vfog1) microwave drive using single-side-band modulation with in-phase 
(1) and quadrature (Q) mixers driven by a local oscillator (LO) and with an 
envelope defined by an arbitrary-waveform generator (AWG) for node A. 
At node B, these drives are synthesized directly by a fast AWG with 

25 GS s~!. d, Schematic of microwave detection lines (black, red triangles). 
All detection lines consist of two isolators, a bandpass filter, a cryogenic 
amplifier (HEMT) and two room-temperature amplifiers followed by a 
filter and analogue down-conversion to an intermediate frequency of 

250 MHz. The down-converted signal is lowpass-filtered, digitized using 
an analogue-to-digital converter and recorded using a field-programmable 
gate array (FPGA). The transmon-readout lines include an additional 
Josephson parametric amplifier (JPA) circuit (red elements) between the 
first two isolators. The JPA is pumped by a signal generator and the 
reflected pump signal from the JPA is cancelled at a directional coupler 
using amplitude- and phase-controlled destructive interference. 


(Fig. 2i) or presence (Fig. 2f) of the absorption pulse, the absorption 
efficiency is determined to reach 98.1% + 0.1%. 

The results of master-equation simulations of the excitation transfer 
(solid lines in Fig. 2), using parameters extracted from independent 
measurements (see Methods), display excellent agreement with the 
measured data. This demonstrates a high level of control over the 
emission and absorption processes and an accurate understanding of 
the experimental imperfections dominated by qutrit decoherence and 
photon loss. 

We demonstrate the use of our protocol to transfer deterministically 
an arbitrary qubit state from node A over a distance of about 0.9 m 
along a coaxial line to node B. This is realized by preparing the receiving 
transmon (B) in state |g), applying a R %; pulse to the sending transmon 
(A), followed by the emission or absorption pulse and finally a rotation 

7, on transmon B. We characterize the quantum state transfer by 
reconstructing its process matrix y with quantum process tomography 
(Fig. 3b). For that purpose, we prepare all six mutually unbiased qubit 
basis states** at node A, transfer them to node B, and reconstruct the 
transferred state using quantum state tomography (see Methods). We 
determine a process fidelity of f= tr(X Xideal) = 80.02% + 0.07%, well 
above the limit of 1/2 that can be achieved using local gates and classical 
communication only. The process matrix im calculated with the mas- 
ter-equation simulations agrees very well with the data (absolute values 
shown as red outlines in Fig. 3b). This is supported by the small trace 
distance’? tr|y — Xsim|/2 =0.015, which ideally is 0 for identical process 
matrices and 1 for orthogonal ones. 
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Fig. 2 | Emission, transfer and absorption of a single photon. a, d, The 
transmons at node B (a) and node A (d) are prepared in the state |f) using 
Gaussian derivative removal by adiabatic gate (DRAG) microwave pulses 
RES" and Riz. b, e, We characterize (filled circles) the time dependence 
(7) of the qutrit populations P, ..¢ while driving the |f, 0) < |g, 1) 
transition. The phase (white—blue shading) of the [f, 0) < |g, 1) transition 
drive is modulated to compensate the drive-induced quadratic a.c. Stark 
shift. c, f, The mean field amplitude squared | (a,,,(7))|° of the travelling 
photons emitted from node B (c) and node A (f) is obtained for the photon 
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Fig. 3 | Quantum state transfer. a, Pulse scheme used to characterize the 
qubit state transfer between the two nodes. We prepare six mutually 
unbiased input states with rotations RO, *Rt/ a ‘Ror! ay, RY 2.y igh ? and 
“Rye at node A (denoted by oe where C is the rotation axis). b, We 
experimentally obtain a process matrix (absolute value || shown as 
coloured bars in the basis of the Pauli matrices I, X =o,, Y = io, and 
Z=0,) witha fidelity of F, = 80.02% + 0.07% relative to the ideal identity 
operation. The grey and red outlines show the ideal value and the master- 
equation simulation of the absolute values of the process matrix, 
respectively. The trace distance between the measurement and the 
simulation is 0.015. 
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state (|0) + |1) )/ V2 that is emitted by preparing each transmon in 

(lg) + Uf) )/ V2. (REE=*/ ? in aand d). The effective photon bandwidths 
are adjusted to be xey¢/(27) = 10.4 MHz and«5-/(2n) =10.6 MHz. The 
solid lines in b, ¢, e, f, h and i are results of master-equation simulations 
(see text for details). g, Excitation transfer protocol. h, The time 
dependence of Py. when executing the excitation transfer protocol from 
qubit A to qubit B with RP". i, The residual |(a,,,(7)) |? (ight yellow, 
multiplied by 50) during the absorption process qe ? ing). 


Furthermore, we use the excitation transfer to generate two-qubit 
remote-entangled states between nodes A and B deterministically. The 
protocol starts by preparing transmons A and B in states 
(je) + |f)) /-/2 and |g) , respectively, and then applying the emission 
or absorption pulses followed by a rotation R?; on transmon B to gen- 
erate the entangled Bell state |7*) = (\e,g) + |g,e)) /-/2 . Because 
leakage to the |f) level at both nodes leads to errors in the two-qubit 
density matrix reconstruction, we extract the full two-qutrit density 
matrix $393 from quantum state tomography experiments 
(see Methods). For illustration purposes, we display the two-qubit den- 
sity matrix p,, (Fig. 4b, c), which consists of the two-qubit elements of 
P33. We find a state fidelity of Fs i, = (Y"|2,|07) =78.9% £0.1% 


compared to the ideal Bell state, and a concurrence 
C(p,,) =0.747 + 0.004 (see Methods for a detailed discussion). The 
density matrix p,m calculated from the master-equation simulations of 
the entanglement protocol (red outlines in Fig. 4) is in excellent agree- 
ment with the experimental results, displaying a small trace distance 
of tr|Om — Psim|/2 = 0.024. We decompose the infidelity into approxi- 
mately 10.5% from photon loss, 9% from finite transmon coherence 
times and 2% from pulse truncation. 

Using transmons with relaxation and coherence times of 
Tige = Trge= 30 js and Ty¢= Te¢=20 |1s, and with an achievable 12% 
loss between the nodes, we calculate that our protocol would enable 
deterministic generation of remote-entangled states with a fidelity of 
93%. In addition, we expect our protocol to be extendable to quantum 
network applications to generate deterministic heralded remote entan- 
glement’, using the three-level structure of the transmons and encoding 
quantum information in different time bins to detect photon loss. These 
perspectives indicate that the approach demonstrated here may serve as 
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Fig. 4 | Remote-entanglement generation. a, Pulse scheme to generate 
deterministic remote entanglement between nodes A and B. b, Expectation 
values of two-qubit Pauli operators (a,0,) . The coloured bars indicate the 
measurement results; the ideal expectation values for the Bell state 

Iw") = (leg) + lg, e))//2 and the results of a master-equation 
simulation are shown as grey and red outlines, respectively. We calculate a 
fidelity of Fs, =78.9% +0.1%, which is well explained by photon loss 
and decoherthee. c, Reconstructed density matrix p,, after execution of the 
remote-entanglement protocol. 


the basis for distributed quantum computation in the circuit quantum 
electrodynamic architecture using distinct cryogenic nodes. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0195-y. 
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METHODS 


Literature overview. In Extended Data Fig. 1, we present a short overview of 
remote-entanglement experiments performed using the systems and schemes listed 
in the caption. We calculate bounds™ on the concurrence C for papers in which a 
CHSH-Bell correlation S was specified but no value for C was given, to provide a 
more complete comparison. However, we do not calculate C for papers that provide 
only a Bell-state fidelity because knowledge of the elements of the density matrix 
are necessary to determine C without further assumptions. 

In addition to the work described here, two independently performed experi- 
ments”>* concurrently realized deterministic remote state transfer and remote- 
entanglement generation with superconducting circuits along the lines of a 
previous proposal’. In contrast to our work, the other experiments”? use 
three-dimensional cavities and transmon qubits with superior coherence prop- 
erties instead of planar two-dimensional systems. They also use radiation fields 
with a Gaussian profile and a duration of 2-4 \1s for state transfer and remote- 
entanglement generation—substantially longer than used here—which made use 
of dedicated Purcell filters to increase the emission and absorption bandwidth of 
the fields used for state transfer. As a result, the concurrence and fidelity found in 
our experiments exceed those found in the experiments in refs 7°74 at comparable 
absorption efficiencies and despite the inferior coherence times. Going beyond our 
work and that presented in ref. 4, ref. presents transfer of multi-photon states and 
discusses the potential of the scheme for implementing error correction at each site. 

We also note a measurement-based probabilistic realization of remote entangle- 

ment in superconducting circuits** that improves on previous work" and a recent 
experiment with nitrogen-vacancy centres” using single-photon interference and 
detection to guarantee deterministic delivery of entangled states at a specified time. 
Sample parameters. The designs are very similar to those used previously”’”, with 
only minor parameter modifications. The /4 coplanar waveguide resonators and 
feed lines are created from etched niobium on a sapphire substrate using standard 
photolithography techniques (Extended Data Fig. 2a) We define the transmon 
capacitor pads and junctions with electron-beam lithography and shadow- 
evaporated aluminium with lift-off. We extract the parameters of the readout cir- 
cuit (grey elements, Fig. 1b) and transfer circuit (yellow elements, Fig. 1b), as well 
as the coupling strength of the transmon to these circuits, from fits to the trans- 
mission spectra of the respective Purcell filter when the transmon is prepared in 
its ground or excited state using a technique and model discussed previously?”*”°*., 
We obtained four working samples from two fabrication runs, with a standard 
deviation of approximately 8 MHz in the frequency 1 of the transfer resonators, 
and used the pair with the best-matching frequencies. We then tuned the transfer 
resonators into resonance (Avy + 0.2 MHz) using the dependence of the resonator 
dispersive shift*? on the transmon-resonator detuning 6= Vge — Vy. To tune the 
transmon frequencies we use a miniature superconducting coil to thread flux 
through the superconducting quantum-interference device (SQUID) at each node. 
Furthermore, the anharmonicity a and the coherence times es and T%, of the 
qutrits are determined using Ramsey-type measurements. We obtain T} ef Ti ge/3 
for the energy decay times Tg and T} es of both transmons, which is lower than the 
expected” T},./2. The excess decay rate may be caused by the more complicated 
environmental mode structure presented to our transmons due to the set of two 
resonators with their respective Purcell filters coupled to it. All relevant device 
parameters are listed in Extended Data Table 1. 
Microwave drive schemes. We use resonant Gaussian DRA microwave pulses 
of length 19.8 ns and 16.8 ns for R7, and R¢ to swap populations between the |g) 
and |e) states and the |e) and if) states, respectively. We extract an averaged 
Clifford-gate fidelity for the |g) and |e) pulses of more than 99.2% for both trans- 
mon qubits, from randomized benchmarking experiments®’. 

We induce the effective coupling ¢ between states |f,0) and |g, 1) by applying 
a microwave tone with drive amplitude < to the transmon, at the resonance fre- 
quency of the transition Cm = 4.022 GHz and Viogi = 3.485 GHz). Following a 
procedure described previously”, we calibrate the a.c. Stark shift of the transmon 
levels induced by the |f,0) <> |g, 1) drive, and extract the linear relation between 
the drive amplitude ¢ and the effective coupling ¢ (Extended Data Fig. 3). To 
remain in resonance with the driven transition, we adjust the phase of ¢ on the 
basis of the measured as Stark shift. We calibrate the drive to reach maximum 
effective couplings of ¢“/(2n) =6.0 MHz and ¢°/(2n) =6.7 MHz (Extended 
Data Fig. 3b), eer fat 

We generate photons with temporal shape #(t) = > a[ Fest Sch (Kept /2) by res- 
onantly driving the |f,0) < |g, 1) transition with 


G* 1,42 
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where ky is the bandwidth of the transfer resonator and Ket < Ky is determined by 
the strength and duration of the transfer pulse. The dynamics are well described by 
a two-level model with loss, captured by the non-Hermitian Hamiltonian 
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where " is the complex conjugate of &. This Hamiltonian acts on states |f,0) and 
|g, 1), analysed in a rotating frame. The non-Hermitian term —ik7/2 accounts for 
photon emission, which brings the system to the dark state |g, 0) . It can be shown 
that using the effective coupling of equation (1) in the Hamiltonian in equation (2) 
leads to the emission of a single photon with the desired temporal shape. This 
analytical solution provides the option of adjusting the effective bandwidth Keg of 
the emitted photon and of generating photon shapes with exponential falling and 
rising edges at rate Keg. In all experiments, we create photons with the maximum 
bandwidth achievable in our set-up, limited by kr ofnode A (ep < Ky, ag*, 2g). 
Ko! = Keg, fesults in a symmetric amplitude and phase profile of the [f,0) <— |g, 1) 
transfer pulse at node A and K > Keg in an asymmetric drive shape at 
node B (equation (1)). For the absorption process of the photon we time-reverse 
the |f,0) < |g,1) drive at node B (Fig. 2g). The photon shape with the 
shortest pulse duration would require an exponential rising edge proportional 
to the bandwidth of the receiving node («-;) and a falling edge proportional 
to the bandwidth of the emitting node («;"). In our experiment, we create 
photons with a symmetric shape, approximately realizing the shortest photon 
duration. 

An alternative protocol to generate a remote-entangled state involves preparing 

the transmon at node A in |f, 0), swapping half of the population to |g, 1) (Rn, ) 
and using the same |g, 1) + [f, 0) absorption pulse at node B as actually realized 
in our experiment (Fig. 4a). Rea can be used to decrease the emission time. 
However, the absorption process requires the same time. Therefore, in our reali- 
zation, there is no advantage to using this modified protocol. 
Three-level single-shot readout. The state of transmon A (B) is read out with a 
gated microwave tone applied to the input port of the readout resonator Purcell 
filter at frequency vj = 4.778 GHz (vf =4.765 GHz). As depicted in Fig. 1b, the 
output signal is routed through a set of two circulators and a combiner and then 
amplified at 10 mK with 22 dB (19.3 dB) gain using a Josephson parametric ampli- 
fier (JPA). The JPA pump tone is detuned by 2 MHz from the measurement signal 
and has a bandwidth of 18.3 MHz (32 MHz). Using these JPAs we find a 
phase-preserving detection efficiency of 7=0.61 (7) = 0.60) for the full detection 
line. The signal is then further amplified by a high-electron-mobility transistor 
(HEMT) at 4 K and two low-noise amplifiers at room temperature. Subsequently, 
the signal is down-converted to 250 MHz using an analogue mixer, lowpass- 
filtered, digitized by an analogue-to-digital converter and processed by a field- 
programmable gate array (FPGA). Within the FPGA, the data are digitally 
down-converted to d.c. and the corresponding I and Q quadrature values are 
recorded during a window of 256 ns in 8-ns time steps. The FPGA trigger is timed 
so that the integration window starts with the rising edge of the measurement 
pulse. We refer to a recording of the I and Q quadrature of a measurement pulse 
as a readout trace, S(t). 

We prepare the transmon in states |g), |e) and [f) 25,000 times each and record 
the single-shot traces. Each trace is then integrated in post-processing, with two 
weight functions, wj(¢) and w(t), to obtain the integrated quadratures 


u= [Swat and v= [S(t)w,(t)dt 


The collected and integrated traces form three Gaussian-shaped clusters in the 
u-v plane (Extended Data Fig. 4), which correspond to the Gaussian probability 
distributions of the trace when the qutrit is prepared in one of the three eigen- 
states. We model the probability distribution x = (u, v) as a sum of three Gaussian 
distributions, with density 
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and estimate the parameters A,, jz; and ¥. On the basis of these parameters, we 
divide the u—v plane into the three regions used to assign the result of the readout 
of the qutrit state (Extended Data Fig. 4). If an integrated trace is in the region 
labelled s’, then we assign it state s’. By counting the number of traces prepared in 
state |s) and assigned the value s’, we estimate the assignment probabilities 
R,,, = P(s'||s)) (Extended Data Fig. 4). We optimize the measurement power and 
signal integration time to minimize the measurement error probability || — R||,/6, 
that is, the sum of the off-diagonal elements of the assignment probability matrix 
(Extended Data Table 2) divided by the number of preparation states. The minimal 
measurement error probability is realized with an integration time of f;} = 112 ns 
(t8 =216 ns) and a measurement power that results in a state-dependent photon 
number in the readout resonator between 0.1 and 2. The probabilities of correct 
assignment range from 93% to 98% for both qutrits (diagonal elements of Extended 
Data Table 2). 
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The probability My to assign value s’ to a single-shot measurement of a qutrit 
in state p is 


My = P(s'| p) = > POs'IIs))2, 


which can be expressed as M = Rpdiag Where Pgiag is a vector consisting of the diag- 
onal elements of p. The assignment probabilities M are typically estimated from 
assignment counts and a first approach to estimate Pgiag is to equate it to M. This 
approach is sensitive to measurement errors, but insensitive to state preparation 
errors. Setting Paiag=R~'M effectively accounts for the effect of single-shot readout 
error. However, this approach relies on the ability to estimate R precisely and is 
therefore sensitive to state-preparation error. With transmon reset infidelities of 
approximately 0.2%* and single-qubit gate errors of 0.8% (measured with rand- 
omized benchmarking), state-preparation errors are expected to be lower than 
readout errors. For this reason, we chose to use the latter approach. 
We note that the assignment probability matrix 

Ressasisg =P (shSpllsasp)) = P(salls,))P (spllsp)) can be obtained as the outer 
product of the single-qutrit assignment probability matrices (compiled in Extended 
Data Table 3) and that we can extend this formalism to correct for single-shot 
readout errors and extract the state populations of a two-qutrit system. 
Estimate of photon loss. We determine the photon loss J, between node A and 
node B by emitting a photon in the coherent state (|0) + |1)) /./2 first from node 
A and then from node B independently, as discussed in the main text. Making use 
of the circulator between the two nodes, we detect the field of each of the emitted 
photons in the same detection line (Fig. 1). The path travelled by the two emitted 
photons towards the detector differs only by the length of the waveguide separating 
the two samples from each other. We evaluate the ratio of the integrated power of 
the detected fields 


\| dt 
t))|° de 


FMasul))" at Kena 
fi \(a, Aanel 


to extract the photon loss /,3 between node A and node B. In addition, we estimate 
the photon loss between node A and B from the specifications of the individual 
elements connecting the nodes: two printed circuit boards, including connectors 
(each 2.5% + 1%), two coaxial cables of length 0.4 m (each 4.0% + 0.1%) anda 
microwave circulator (13% + 2% according to manufacturer). With these para- 
meters we estimate an accumulated photon loss between node A and node B of 
24% + 3%, in good agreement with the measured value of [4g = 23.0% + 0.5%. 
Master-equation simulation. We model the transmons as anharmonic oscillators 
with annihilation (creation) operators b, 6, )°, where the subscript i € {A, B} 
denotes the emitter and receiver samples, respectively. The transfer resonator anni- 
hilation (creation) operators are denoted as 4; (4;'). Setting h = 1, the driven 
Jaynes-Cummings Hamiltonian for sample i is 


= wide, + wb; 6, + ious +6) 
ee distin x (3) 
+g1(4;'b; + a;b; )— Fes, by "bi 


where g' denotes the coupling between the transmon and the transfer resonator, 
Ej, denotes the charging energy of the transmon and 2(t) = 2' cos[wjt + y'(t)] 
is the amplitude of the microwave drive that induces the desired coupling ¢(t). 
Because the readout resonators do not play a part in the photon transfer dynamics, 
they are omitted from the Hamiltonian; the static Lamb shifts that they induce are 
implicitly included in the parameters. 

To make the effective coupling ¢(t) between the |f, 0) and |g, 1) states apparent 
and to simplify the simulations, we perform a series of unitary transformations on 
equation (3). We first move to a frame rotating at the drive frequency wj, and then 
perform a displacement transformation b, > b,—',4, > 4,—7’, choosing (3 and 
7 so that the amplitude of the linear drive terms is set {8 zero. Next, we perform a 
Bogoliubov transformation b, = Kee ‘\b; —sin(A' ad; a, — cos( A’ \d; +sin(A’)b, 
where tan(2.A') = —2g.. /( wh yet 2E¢ [G'l). Neglecting small off-resonant 
terms, we obtain the resulting effet tive Hamiltonian 


gi int, pte  Optpte K' atete a 
Ay= Ay4a;'a;+ Ave ; 6, + —o b; bb, + 3 a; ad; 
(4) 
intapte Lo olatet. | anges 
42X94; G;b; bj + oy 8h b; 4; + gd; b;b;) 
where a’ = —E{cos' A is the transmon abhabmoninily, K'= —Efsin’ A’ is the 


qubit-induced resonator anharmonicity, \. = = — Ecos" A'sin® Al is the dispersive 
shift, At = wpcos” Ai + (Wye 2Ec |8'| )sin? A'— ~g, sin 2A'—wj_ is the resona- 
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tor—drive detuning and Aj. = (wy.—-2E¢ \a'| 5 cos” A! + wesin® A+ St sin 2A'—w} 
is the qubit-drive detuning. In equation (4), the desired effective coupling 
z — —Eji,3'/2 cos” Aisin A’ between the |f,0) and |g,1) states is now made 
explicit. 

Finally, moving to a frame rotating at A. for the resonator and Pi +a’ /2 for 
the transmon qubits, the combined effective Hamiltonian of the two samples is 


1 wapipt 
+= [2'(Nb; b; 4; 
ae) [g'(t) 
a. 
—i (43-44) 


OWA nba} 


where 7, is the photon-loss probability of the circulator between the two samples. 
Using this effective Hamiltonian, numerical results are obtained by integrating 
the master equation 


— [Herp 
+Kp(— 7.) \Dlalo+ Di frin. ay +. Kp asl p 
+ SO thin Pl@]p + r44¢P Ug) (el Jo 

(5) 
+%erP He) 1) A} 
+ DO {geP Ue) Cel lg) (al 0 

i=A,B 

Ver UF) (Alle) Cel JA} 

where D[O]+ = 0-0 = O 1 Ove } /2denotes the dissipation super-operator, Kyi, the 


internal decay rates of the resonators, y,,, = 1/T; nm the decay rates of the a 
mon qubits between the |) , and |m) , states and Yon =1/(2T) im) —1/ (Tom) 
the dephasing rates between the |7) , and |m), states of the transmon qubits. The 
last term in H,, combined with the resonator dissipators i in the second line of the 
master equation (equation (5)), assure that the output of the emitter A is cascaded 
to the input of the receiver B*** through a circulator with photon loss 7. 
Quantum state and process tomography. Quantum state tomography of a single 
qutrit is performed by measuring the qutrit state population with the single-shot 
readout method, after applying the following tomography gates: Woes gel Ss 

Ree RE, ORY PRY CRE RY), (RE ORT) and ("RE RY). The 
elements of the density ae are then cecomsenced using a maximum-likelihood 
method, assuming ideal tomography gates. 

To extend this quantum state tomography procedure to two-qutrit density 
matrices, we perform two local tomography gates (from the 81 pairs of gates 
that can be formed from the single-qutrit quantum state tomography gates) on 
transmons A and B, before extracting the state populations using the two-qutrit 
single-shot measurement method. 

To characterize the qubit state transfer from node A to node B, we perform full 
quantum process tomography” (Fig. 3, Extended Data Table 4). We prepare each 
of the six mutually unbiased qubit basis states** |g), |e), (|g) + le))/V2; 
(Ig) +i le)) //2, (lg) —le)) /./2 and (|g) —i |e)) /./2, transfer the state to node 
B, then independently measure the qutrit density matrix at node A and node B 
with quantum state tomography. We obtain the process matrix from these density 
matrices through linear inversion. Quantum state tomography of the qutrit sub- 
space is required to characterize the residual population in |f) after the qubit state 
transfer, which is caused mainly by decay from the [f) level in combination with 
Rér swapping |e) with [f) populations. The density matrices that we obtain have 
anon-unit trace in the qubit subspace, and so does the qubit state transfer process 
matrix. Consequences of that observation are discussed below for the example of 
the Bell-state density matrix. 

Two-qutrit entanglement. Owing to a residual population of 3.5% of the |f) level 
of the transmons after the entanglement protocol, the entangled state cannot be 
described rigorously by a two-qubit density matrix. To be concise, we represent 
the reconstructed two-qutrit entangled state 33 (Extended Data Fig. 5, Extended 
Data Table 5) by a two-qubit density matrix p, that consists of the two-qubit ele- 
ments of £323. This choice of reduction from a two-qutrit to a two-qubit density 


+ + + +). 
=" Lal Y") =" Iesa3l Ys how- 
ever, Pm has a non-unit trace. In addition, this reduction method gives a 


matrix conserves the state fidelity F ve 
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conservative estimate of the concurrence C(p,) compared to a projection of p3q3 
on the set of physical two-qubit density matrices. 

To verify the three-level bipartite entanglement, we use the computable cross- 
norm or realignment criterion*’, which is well defined for multi-level mixed entan- 
gled states. This criterion states that a state p must be entangled if >, A, > 1, with 
p= dA ,Ge ® Gp and G) an orthonormal basis of the observable spaces of 
HA®. We obtain >, Ag= 1.612 + 0.003 with the measured entangled state 323, 
providing unambiguous evidence for the existence of entanglement of the prepared 
state. 

Data availability. The data that support the findings of this study are available 
within the paper. 
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Extended Data Fig. 1 | Overview of remote-entanglement experiments. 
a, Entanglement generation rate I’... b, CHSH-Bell inequality*?*° 
correlation S. c, Concurrence C. d, Entangled state fidelity F*. The 
experiments are grouped by physical system: atomic ensembles 
(‘ae’)!221-51-53, trapped ions (‘ion’)!*!7°4-%S, single-atom Bose-Einstein 
condensate (‘sab’)°’, vibrational state of diamonds (‘vs’)!8, rare-Earth- 
doped crystals (‘rec’)°, single atoms (‘sa’)'®*, nitrogen—vacancy centres 
(‘nv’)?03659.0, superconducting circuits (‘sc’) 4792456! or quantum dots 
(‘qd’)!®°?. The colours indicate different implementations: probabilistic 
unheralded (red), probabilistic heralded (blue), guaranteeing a 
deterministic delivery of an entangled state at a pre-specified time (yellow) 
or fully deterministic (green). The plot markers indicate different schemes 
for realizing the remote interaction: measurement-induced (triangles), 
single-photon (crosses) or two-photon (squares) interference and 
detection, direct transfer (diamond) or direct transfer with shaped photons 
(circles). The lines in c are bounds* on the concurrence C calculated from 
measured CHSH-Bell correlations S. The shaded column highlights this 
study. 
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Extended Data Fig. 2 | Micrograph of sample and energy-level diagram. 
a, False-colour micrograph of a sample of the same design co-fabricated 
with the one used for node A. The circuit elements are colour coded as in 
Fig. 1: transfer circuit (yellow), readout circuit (grey), transmon (orange) 
and input lines of the transmon and readout circuit (blue). The input 

to the transfer circuit is used as an auxiliary port to perform resonator 
spectroscopy in transmission. The inset shows a scanning electron 
microscopy (SEM) micrograph of the asymmetric SQUID with a ratio of 
5:1 between the areas of the Josephson junctions used in the transmon. 

b, Schematic of the energy-level diagram of the coupled transmon-transfer 
resonator system. The numerical values of all parameters are listed in 
Extended Data Table 1. 
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eff. coupling, 9/27 [MHz] 
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drive amplitude, €f9g1 [norm.] 


Extended Data Fig. 3 | a.c. Stark shift and Rabi rate of the |f,0) < |g, 1) 
transition. a, b, Measurement (filled circles) of the a.c. Stark shift Ajog:/ 
(27) (a) and the effective coupling ¢/(27) of the |f,0) < |g, 1) transition 
(b) versus drive amplitude €j,; for sample A (blue) and B (red). The solid 
lines in a (b) are quadratic (linear) fits to the data*”. 
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integrated qudarature, v [arb. units] 


integrated qudarature, v [arb. units] 


-2. -15 -1. -05 0. 0.5 -2. -15 -1. -05 0. 05-2. -15 -1. -05 0. 0.5 
integrated qudarature, u [arb. units] integrated qudarature, u [arb. units] integrated qudarature, u [arb. units] 


Extended Data Fig. 4 | Qutrit single-shot readout characterization. repetitions for each state-preparation experiment. The dashed lines are the 
a-f, Scatter plot of the measured integrated quadrature values u and v for qutrit-state discrimination thresholds used to obtain the assignment 
qutrit A (a-c) and B (d-f) when prepared in state |g) (blue; a, d), |e) (red; probabilities (numbers, which are also listed in Extended Data Table 2). 

b, e) and |f) (green; c, f). We plot only the first 1,000 of the 25,000 
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Extended Data Fig. 5 | Characterization of entangled states in a the identity operation, ,,2,3 denote the Pauli matrices 0, , in the qubit 
two-qutrit basis. a~c, Two-qutrit state tomography: real (a) andimaginary subspace, 4,5 correspond to os , Ae,7 correspond to ot and Ag is the 
(b) part of the density matrix and expectation values of the Gell-Mann diagonal matrix (o2° + 20%") / V3. The trace distance between the 


operators x (c). The ideal Bell state |y*) and numerical master-equation measurement and the simulation is 0.107. 
simulation are depicted as grey and red outlines, respectively. A) denotes 
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Extended Data Table 1 | Summary of device parameters for nodes A and B 


quantity, symbol (unit) 


Node A Node B 


readout resonator frequency, vp (GHz) 
readout Purcell filter frequency, vpps (GHz) 
readout resonator bandwidth, «r/2m (MHz) 
readout circuit dispersive shift, yr/27 (MHz) 
transfer resonator frequency, vr (GHz) 8.4005 
transfer Purcell filter frequency, vpp¢ (GHz) 8.426 
transfer resonator bandwidth, hr /27 (MHz) 
transfer circuit dispersive shift, x7 /27a (MHz) 
qubit transition frequency, Vg. (GHz) 

transmon anharmonicity, a (MHz) 

|f,0) + |g, 1) transition frequency, Vyog1 (GHz) 
|f,0) + |g, 1) max. eff. coupling, Gm/27 (MHz) 
energy relaxation time on ge, Tige (jus) 

energy relaxation time on ef, Tier (us) 
coherence time on ge, ice (us) 


coherence time on ef, TR, (us) 
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Extended Data Table 2 | Probabilities of identifying the prepared 
states (columns) as the measured states (rows) for qutrits A and B 


Qutrit A Qutrit B 


ls) le) If 
g|98.2 5.0 1.3]/98.5 3.9 1.2/g 


e] 1.0 93.3 4.8]|0.9 93.5 6.1 Je 


f} 0.8 1.7 94.0]| 0.6 2.5 92.7/f 


The diagonal elements show correct identification; the off-diagonal elements show 


misidentifications. 
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Extended Data Table 3 | Probabilities of identifying the prepared 
input states (columns) as the indicated output states (rows) for all 
possible tuples of two-qutrit basis states 


/ |lee) Ise) lef) lee) lee) lef) Ife) Ife) |e) 


e¢|96.8 3.9 11 4.9 0.2 0.1 12 0.0 0.0 


0.9 91.9 6.0 0.0 4.7 0.3 0.0 1.2 0.1 
0.6 2.5 91.1 0.0 0.1 46 0.0 0.0 1.2 
1.0 0.0 0.0 91.9 3.7 1.1 4.7 0.2 0.1 
0.0 0.9 0.1 0.8 87.3 5.7 0.0 4.5 0.3 
0.0 0.0 0.9 0.6 2.4 86.5 0.0 0.1 4.4 
0.8 0.0 0.0 1.6 0.1 0.0 92.5 3.7 1.1 
0.0 0.7 0.0 0.0 1.6 0.1 0.8 87.9 5.8 


0.0 0.0 0.7 0.0 0.0 1.6 0.6 2.4 87.1 


The diagonal elements show correct identification; the off-diagonal elements show 
misidentifications. 
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Extended Data Table 4 | Numerical values of the experimentally obtained process-matrix elements of the qubit state transfer 


I 0.8002 0.0008-0.00212 -0.0036-0.00332 0.0818-0.00382 


X| 0.0008+0.00212 0.0748 -0.0734 0.0017+0.0037% 


Y|-0.0036-+0.00332 -0.0734 0.0727 0.0011+0.0028:2 


Z| 0.0818+0.00382 0.0017-0.00377 0.0011-0.00282 0.0313 


The absolute value of this process matrix is depicted in Fig. 3 as coloured bars. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Extended Data Table 5 | Numerical values of the experimentally obtained density-matrix elements of the two-qubit remote-entangled state 
in a two-qutrit basis 


gg ge gf eg ee ef fg fe ff 
gg 0.142 -0.001 0.002-0.005i 0.001-0.001i 0.001-0.001% 0.001 -0.006-0.0042 0.016-0.027: 0.001+0.0017 
ge -0.001 0.343 -0.008-0.021% 0.378 0.003++-0.0022 -0.003-0.0042 0.003+-0.0062 -0.005-0.002% 0.001 
gf | 0.002+0.0057 -0.008-+0.021% 0.015 -0.004 -0.002 -0.001-0.002: 0.005-0.002i 0.002-0.001% 0. 
eg| 0.001+0.0017 0.378 -0.004 0.48 0.002-0.0012 0.001-0.01% 0.013+40.0297 -0.002-0.001% 0.002 
ee] 0.001+0.0012 0.003-0.002% -0.002 0.002+0.001% 0. 0. -0.001 0. 0. 
ef 0.001 -0.003-++0.0047 -0.001+0.002i 0.001+0.01% 0. 0.001 -0.001+-0.0022 0. 0 
fg |-0.006+0.0047 0.003-0.0062 0.005+0.002i 0.013-0.029% -0.001 -0.001-0.002 0.012 0.001-+0.001% 0. 
fe | 0.016+0.0277 -0.005+0.002i 0.002+0.0017 -0.002+0.001i 0. 0. 0.001-0.001% 0.007 0 
ff | 0.001-0.001% 0.001 0. 0.002 0. 0. 0. 0. 0 
The real and imaginary parts of this density matrix are depicted as coloured bars in Extended Data Fig. 5a and b, respectively. 
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Deterministic delivery of remote entanglement ona 


quantum network 


Peter C. Humphreys! 
Matthew Markham? & Ronald Hanson" 


Large-scale quantum networks promise to enable secure 
communication, distributed quantum computing, enhanced 
sensing and fundamental tests of quantum mechanics through 
the distribution of entanglement across nodes’. Moving beyond 
current two-node networks*° requires the rate of entanglement 
generation between nodes to exceed the decoherence (loss) rate of 
the entanglement. If this criterion is met, intrinsically probabilistic 
entangling protocols can be used to provide deterministic remote 
entanglement at pre-specified times. Here we demonstrate this using 
diamond spin qubit nodes separated by two metres. We realize a 
fully heralded single-photon entanglement protocol that achieves 
entangling rates of up to 39 hertz, three orders of magnitude 
higher than previously demonstrated two-photon protocols on this 
platform". At the same time, we suppress the decoherence rate of 
remote-entangled states to five hertz through dynamical decoupling. 
By combining these results with efficient charge-state control and 
mitigation of spectral diffusion, we deterministically deliver a fresh 
remote state with an average entanglement fidelity of more than 0.5 
at every clock cycle of about 100 milliseconds without any pre- or 
post-selection. These results demonstrate a key building block for 
extended quantum networks and open the door to entanglement 
distribution across multiple remote nodes. 

The power of future quantum networks will derive from entangle- 
ment that is shared between the network nodes. Two critical parameters 
for the performance of such networks are the entanglement-generation 
rate rent between nodes and the entangled-state decoherence rate 
Tdec- Their ratio Mink = Tent/Tdecs Which we term the quantum link effi- 
ciency®'*, quantifies how effectively entangled states can be preserved 
over the timescales necessary to generate them. Alternatively, the link 
efficiency determines the average number of entangled states that can 
be created within one entangled-state lifetime. A link efficiency of unity 
therefore represents a critical threshold above which entanglement can 
be generated faster than it is lost. Exceeding this threshold is central to 
allowing multiple entangled links to be created and maintained simul- 
taneously, as is required for the distribution of many-body quantum 
states across a network®!, 

Consider an elementary entanglement-delivery protocol that delivers 
states at pre-determined times. This can be achieved by making 
multiple attempts to generate entanglement and then protecting 
successfully generated entangled states from decoherence until the 
required delivery time (Fig. la, steps (1)-(3)). If we try to generate 
entanglement for a period fent then the cumulative probability of 
success will be p |. =1—e ' ‘ent, For a given Psucc, the average fidelity 
Fouce with respect toa maximally entangled state of the successfully 
generated states is solely determined by the quantum link efficiency 
Mink (Methods). We plot Feucc Versus Psucc for several values of Mink in 
Fig. 1b. 

This protocol allows entangled states to be delivered at specified 
times, but with a finite probability of success. By delivering an 
unentangled state (state fidelity Funent < 1/2) in cycles in which all 
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entanglement-generation attempts failed, the protocol can be cast into 
a fully deterministic black box (Fig. 1a, step (4)). The states output from 
such a black box will have a fidelity of 

Fue = +(1 Pesce DE aitet ( 1) 


Parcel suce + 


The maximum achievable fidelity Fjv;" of this deterministic state- 


delivery protocol, found by optimizing Puc) is also determined only by 
the quantum link efficiency mink. For Funent = 1/4 (a fully mixed state), 
we find (Fig. 1c) 
max 1 
F det — 7 [ 


1/(1-7, 

1437/0 tw] (2) 
For Mink greater than about 0.83, there exists a combination of Psucc and 
Fsucc high enough to compensate for cycles in which entanglement is 
not heralded, enabling the deterministic delivery of states that are 
entangled on average (Fyz;* > 1/2) . Deterministic entanglement 
delivery is therefore a critical benchmark of the performance of a 
network, certifying that its quantum link efficiency is of order unity or 
higher. Furthermore, the ability to specify in advance the time at which 
entangled states are delivered may assist in designing multi-step 
quantum-information tasks such as entanglement routing!®””. 

However, so far, quantum link efficiencies of order unity or greater 
have remained out of reach for solid-state quantum networks. Quantum 
dots have been used to demonstrate kilohertz entanglement rates rent, 
but decoherence rates rg.. of tens of megahertz limit their quantum link 
efficiencies'*” 7:4, to around 107+. Nitrogen-vacancy (NV) centres— 
point defects in diamond with a long-lived electron spin and bright 
optical transitions—have been used to demonstrate entanglement rates 
Tent of tens of millihertz!*" and, in separate experiments, decoherence 
rates rdec of the order of one hertz", which together would give link 
efficiencies Mink of roughly 107. 

Here we achieve quantum link efficiencies jn, well in excess of 
unity by realizing an alternative entanglement protocol for NV centres 
in which we directly use the state heralded by the detection of a single 
photon (Fig. 2)?4?”. The rate for such a single-photon protocol scales 
linearly with losses, which, in comparison with previously used 
two-photon-mediated protocols”"4, provides a substantial advantage in 
typical remote-entanglement settings. Recent experiments have high- 
lighted the potential of single-photon protocols by generating local 
entanglement”*4, and remote entanglement in post-selection'®”. By 
realizing a single-photon protocol in a fully heralded fashion and pro- 
tecting entanglement through dynamical decoupling, we achieve the 
deterministic delivery of remote-entangled states on an approximately 
10-Hz clock. 

Our experiment uses NV centres that reside in independently oper- 
ated cryostat set-ups separated by 2 m (further experimental details are 
given in Methods). We use qubits formed by two of the ground-state 
spin sublevels of the NV centre (|1) = |m,=0) and ||) =|m,=—1), 
where m, is the projection of the spin along its quantisation axis). 
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Fig. 1 | Deterministic remote-entanglement delivery. a, Deterministic 
entanglement delivery guarantees the output of states with an average 
entanglement fidelity of more than 0.5 at pre-specified times. In our 
protocol, underlying this deterministic delivery is a probabilistic but 
heralded entanglement process. Repeated entangling attempts (dashed 
helical links) are made (1) and then, upon heralded success (2), the 
entangled state (solid helical link) is protected from decoherence 
(represented by the lock) until the specified delivery time (3). If no attempt 
at entanglement generation succeeds within one cycle, an unentangled 
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Fig. 2 | Benchmarking single-photon entanglement generation. 

a, Experimental protocol. (1) Before entanglement generation, an 
NV-centre state check verifies that the NV centre is in the correct charge 
state (the negatively charged state) and resonant with the excitation laser 
(discussed further in Methods); this state is denoted ‘ready’. This is 
repeated until the check passes. (2) Entanglement generation is attempted 
until success is heralded, in which case we continue to readout (step (3)). 
If 250 attempts have been made without success, we revert to step (1). 

(3) Upon heralded success, the spin states are read out in a chosen basis by 
using microwaves to rotate the state, followed by single-shot readout (light 
bulbs indicate the detection of the bright (1) or dark (0) spin state). b, The 
left panel shows the optical set-up used for entanglement generation, in 
which the optical excitation pulses for each node are derived at a beam 
splitter. The single photons emitted by the nodes as a result of these 
excitation pulses are interfered on another beam splitter, completing an 
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state must be delivered (4). b, For the underlying entanglement-generation 
and state-preservation protocol (steps (1)-(3) in a), the effectiveness 

of the trade-off between the average fidelity of the entangled state that 

is delivered and the success probability is determined by the quantum 

link efficiency mink. The dashed line represents the classical threshold 

of F=0.5, above which a state is entangled. c, Maximum fidelity of 
deterministically delivered states as a function of mink. A critical threshold 
of mink & 0.83 (vertical green line) must be surpassed to deliver an 
entangled state at every cycle on average. 
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effective interferometer between the nodes. The optical phase difference 
A@ acquired in this interferometer must be known. At pre-determined 
intervals, light is injected into the interferometer, as shown in the right 
panel. This light is measured using the same detectors that herald 
entanglement, and the signal is fed back via a microcontroller to a 
piezo-electric fibre stretcher that is used to compensate for phase drifts. 
For the data reported here, we stabilize the phase difference every 180 ms. 
c, Measured (XX) and (YY) correlations (left) for Wo; (where 0/1 denotes 
the heralding detector) and a =0.1 as the readout basis is swept at node A 
(inset). The right panel shows the measured (ZZ) correlations. 

d, e, Fidelity of the heralded states with respect to a Bell state (d) and 
entanglement-generation success rate (e), for different values of a. 

For c-e, solid lines (with shaded 1-s.d. statistical uncertainties) are the 
predictions of our model based solely on independently determined 
parameters (Methods). Error bars in c and d represent 1 s.d. 
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Fig. 3 | Coherence protection of remote-entangled states. a, Dynamical 
decoupling protects the state of the NV-centre spins from quasi-static 
environmental noise. To protect a spin for a time T, N= T/(2t) inverting 
(1m) microwave pulses are applied at 2t intervals. For node A, t= 40.320 1s; 
for node B, t= 36.148 ks. b, Fidelity with respect to the initial state for 
dynamical decoupling of the state (|t) + ||))/J/2 at each of our nodes. 
Solid lines show exponential fits with coherence times of 290(20) ms and 


1 1 1 j 
100 150 200 250 


Decoupling duration (ms) 


Single-photon entanglement generation (Fig. 2a) proceeds by first ini- 
tializing each node in |t) by optical pumping”, followed by coherent 
rotation using a microwave pulse” to create the state 


INV) = -Va|T) + v1—a|J) (3) 


where a is determined by the choice of microwave pulse. We then apply 
resonant laser light to excite selectively the ‘bright state |) to an 
excited state, which rapidly decays radiatively back to the ground state 
by emitting a single photon. This entangles the spin state of the NV 
centre with the presence (|1)) or absence (|0)) of a photon in the 
emitted optical mode: 


|NV, optical mode) = Va|T)|1) + /1—a||)|0) (4) 


Emitted photons are transmitted to a central station at which a beam 
splitter is used to remove their ‘which path information. Successful 
detection of a photon at this station indicates that at least one of the NV 
centres is in the bright state |) and therefore heralds the creation ofa 
spin-spin entangled state. However, given the detection of one photon, 
the conditional probability that the other NV centre is also in the |T) 
state, but that the photon it emitted was lost, is p= ca (in the limit 
Pree <A where Pet is the photon detection efficiency). This degrades 
the heralded state from a maximally entangled Bell state 


Ib) =(ITL) + 11 1))/ V2 to 
Pry ny =C—-a) |) dl + al TTT (5) 


The probability of successfully heralding entanglement is 2paea. The 
state fidelity F= 1 — a can therefore be traded off against the entangle- 
ment rate directly. The corresponding success probability of a 
two-photon protocol is Pi /2; for a given acceptable infidelity a, 
single-photon protocols will therefore provide a rate increase of 
4a/ per. For example, for our system’s paet © 4 x 10~4, if a 10% infidel- 
ity is acceptable, then the rate can be increased by three orders of 
magnitude compared to two-photon protocols. 

The primary challenge in implementing single-photon entangle- 
ment is that the resulting entangled state depends on the optical phase 
acquired by the laser pulses used to create spin-photon entanglement 
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680(70) ms for nodes A and B, respectively. c, Dynamical decoupling of 
entangled states created using the single-photon entanglement protocol for 
a@=0.12 and a= 0.2. Solid lines (with shaded 1-s.d. statistical 
uncertainties) show the predictions of our model (Methods) based on the 
data in b, from which the entangled-state coherence time is expected to be 
7 = 200(10) ms. Error bars in b and c represent 1 s.d. 


at each node, as well as on the phase acquired by the emitted single 
photons as they propagate (Fig. 2b). The experimental set-up therefore 
acts as an interferometer from the point at which the optical pulses are 
split to the point at which the emitted optical modes interfere. For a 
total optical phase difference of AQ, the entangled state created is 


[bo (A8)) = IT) te] 11) (6) 


where 0/1 (with corresponding + phase factor) denotes the detector 
at the central station that detected the incident photon. This optical 
phase difference must be known to ensure that entangled states are 
available for further use. 

We overcome this entangled-state phase sensitivity by interleaving 
periods of optical-phase stabilization with our entanglement genera- 
tion. During phase stabilization we input bright laser light at the same 
frequency as the NV-centre excitation light and detect the light reflected 
from the diamond substrate using the same detectors that are used to 
herald entanglement. The measured optical phase, estimated from the 
detected counts, is used to adjust the phase back to our desired value 
using a piezoelectric fibre stretcher. We achieve an average steady-state 
phase stability of 14.3(3)°, limited by the mechanical oscillations of the 
optical elements in our experimental set-up (the error quoted here and 
elsewhere is one standard deviation; Methods, Extended Data Fig. 6). 

To demonstrate the controlled generation of entangled states, we run 
the single-photon entangling protocol with a bright-state population 
of a=0.1. After entanglement is heralded, we apply basis rotations and 
single-shot state readout” at each node (A and B) to measure (o;'0;’) 
correlations between the nodes, where hereafter the standard Pauli 
matrices are referred to in the shorthand oy, oy, 0z=X, Y, Z. We 
observe strong correlations for (XX) and (YY) and, when sweeping 
the readout basis for node A, oscillations of these coherences, as 
expected from the desired entangled state (Fig. 2c, left). In combination 
with the measured (ZZ) correlations (Fig. 2c, right), this finding 
unambiguously demonstrates the establishment of entanglement 
between our nodes. 

We explore the trade-off between the entangled-state fidelity and the 
entanglement rate by measuring (XX), (YY) and (ZZ) correlations 
for a range of different initial bright-state populations a. Using these 
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Fig. 4 | Deterministic entanglement delivery. a, Each deterministic 
entanglement-delivery cycle combines: (1) optical phase stabilization; 

(2) NV-centre state checks, repeated until a threshold number of photons 
are detected at each node; (3) attempts at probabilistic entanglement 
generation (Fig. 2); and (4) upon heralded entanglement success, state 
protection by dynamical decoupling until the delivery time. 

b, Distribution of deterministic entanglement delivery outcomes for 
a=0.12 (left) and a = 0.2 (right) and different delivery rates. The fraction 
of cycles in which a herald photon is detected (‘heralded success’), in 
which no herald is detected (‘no heralded success’) and in which the NV- 


correlations, we calculate the fidelity of the heralded state relative to 
the desired maximally entangled Bell state for each value of a (Fig. 2d), 
along with the measured success rate (Fig. 2e). As predicted, the fidelity 
increases with decreasing a as the weight of the unentangled state 
I+) (7 7| diminishes (equation (5)). For small a, the fidelity saturates 
because the dark-count rates of the detectors become comparable to 
the detection rate. 

Choosing a to maximize fidelity, we find that our protocol allows us 
to generate entanglement with a fidelity of 0.81(2) at a rate of ren. =6 Hz 
(for a=0.05). Alternatively, by trading the entanglement fidelity for 
rate, we can generate entanglement at rent = 39 Hz with an associated 
fidelity of 0.60(2) (a=0.3). This represents an increase in the entan- 
gling rate of two orders of magnitude compared to previous NV-centre 
experiments’? and of three orders of magnitude compared to two- 
photon protocols under the same conditions'*. Compared to the 
maximum theoretical fidelity for a = 0.05 of 0.95, the states we generate 
have a 3% reduction in fidelity due to residual photon distinguishability, 
4% from double excitation, 3% from detector dark counts and 2% from 
optical-phase uncertainty (Methods). 


Delivery rate (Hz) 


centre state checks for at least one of the NV centres fail repeatedly for the 
whole cycle (‘offline’; often too small to be visible in the plot) are shown. 
The lines show the success rates predicted by our model. c, Average fidelity 
of deterministically delivered entangled states for a = 0.12 (left) and 
a=0.2 (right) and different delivery rates (diamonds). The average fidelity 
if classically correlated states were delivered for cycles in which no success 
event was heralded is also shown (circles). The associated lines (with 
shaded 1-s.d. statistical uncertainties) are the corresponding predictions of 
our model (Methods). Error bars in b and c represent 1 s.d. 


To reach a sufficient link efficiency mink to enable deterministic 
entanglement delivery, the single-photon protocol must be combined 
with robust protection of the remote-entangled states that are gen- 
erated. To achieve this, we carefully shielded our NV centres from 
external noise sources, including residual laser light and microwave 
amplifier noise, leaving as the dominant noise the slowly fluctuating 
magnetic field induced by the surrounding nuclear spin bath. 

We mitigate this quasi-static noise by implementing dynamical 
decoupling with ‘XY8’ pulse sequences (Fig. 3a, Methods, Extended 
Data Fig. 9). The fixed delay between microwave pulses in these 
sequences is optimized for each node”’. Varying the number of decou- 
pling pulses allows us to protect the spins for different durations. This 
dynamical decoupling extends the coherence time of node A and node 
B from about 5 1s to 290(20) ms and 680(70) ms, respectively (Fig. 3b). 
The difference in coherence times for the two nodes is attributed to 
differing nuclear-spin environments and microwave-pulse fidelities. 

To investigate the preservation of remote-entangled states, we 
incorporate dynamical decoupling for varying durations after suc- 
cessful single-photon entanglement generation (Fig. 3c). We find 
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an entangled-state coherence time of 200(10) ms (decoherence rate 
Ydec = 5.0(3) Hz). The observed entangled-state fidelities closely match 
the predictions of our model, which is based solely on independently 
determined parameters (Methods, Extended Data Table 1). In particu- 
lar, the decoherence of the remote-entangled state is fully explained by 
the combination of the individual decoherence rates of the individual 
nodes. 

The combination of dynamical decoupling and the single-photon 
entanglement protocol achieves a quantum link efficiency of ink 8; 
well above the critical threshold of mink 0.83 and comparable to the 
published® state-of-the-art in ion traps, Mink © 5. 

These innovations enable the design of a deterministic entangle- 
ment-delivery protocol that guarantees the delivery of entangled states 
at specified intervals, without any post-selection of results or pre-selec- 
tion based on the nodes being in the appropriate conditions (Fig. 4a). 
Phase stabilization occurs at the start of each cycle, after which there 
is a pre-set period before an entangled state must be delivered. This 
window must therefore include all NV-centre state checks (necessary to 
mitigate spectral diffusion via feedback control and to verify the charge- 
state and resonance conditions”), entanglement-generation attempts 
and dynamical decoupling necessary to deliver an entangled state. Fast 
conditional logic is used to adapt the experimental sequence dynami- 
cally on the basis of the detection of a heralding signal”'®**”°. Further 
details on the experimental implementation are given in Methods and 
Extended Data Fig. 1. 

We run our deterministic entanglement-delivery protocol at two val- 
ues of a (0.2 and 0.12) and for delivery rates of 7-12 Hz. We divide the 
experiment into runs of 1,500 cycles (that is, 1,500 deterministic-state 
deliveries), for a total dataset of 42,000 cycles. 

We first confirm that heralded entanglement occurs with the 
expected probabilities (Fig. 4b) by determining the fraction of cycles 
in which entanglement is heralded, in which no entangling attempts 
succeed and in which entanglement attempts do not occur at all because 
the NV-centre state check never succeeds. To establish reliable and 
useful quantum networks, it is important that entangled states can 
be delivered with high confidence over long periods. The nodes must 
therefore not be offline, for example, owing to uncompensated drifts 
in the resonant frequencies of the optical transitions. We therefore do 
not stop the experiment from running once it starts and include any 
such offline cycles in our datasets. Their negligible contribution (0.8% 
of cycles) confirms the robustness of our experimental platform and 
the effectiveness of our NV-centre frequency and charge-state control 
(discussed further in Methods). 

For each value of a and for each pre-set delivery interval, we deter- 
mine the average fidelity of the deterministically delivered states by 
measuring their (XX), (YY) and (ZZ) correlations (Fig. 4c). We find 
that for ~=0.2 and a rate of 9.9 Hz, we are able to create states witha 
fidelity of 0.56(1), demonstrating successful deterministic- 
entanglement delivery. 

Our model (solid lines in Fig. 4c) captures the trends of the deter- 
ministic entanglement-delivery data effectively. However, the observed 
state fidelities are slightly lower than the predicted ones, hinting at 
sources of decoherence that are not included in our model (Methods, 
Extended Data Fig. 4). Identifying these potential sources will be the 
subject of future work. 

During cycles in which entanglement is not successfully heralded, 
the spin states are nonetheless delivered and read out. In these cases, we 
deliver the state that the NV centres are left in after a failed entangle- 
ment attempt, which has a low fidelity with respect to the desired Bell 
state (for example, Funent = 0.04 for a=0.2). Although this stringent test 
highlights the robust nature of our protocol, we could instead deliver a 
mixed state (Funent = 1/4) or a classically correlated state (Funent = 1/2) 
when a successful event is not heralded. The resulting fidelities for 
our experimental data if classically correlated states were delivered are 
also plotted in Fig. 4b (grey circles). In this case, we would be able to 
deliver entangled states deterministically with fidelities of 0.62(1) at a 
rate of 9.9 Hz. 
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The deterministic entanglement delivery between remote NV centres 
demonstrated here is enabled by a quantum link efficiency exceeding 
unity. Straightforward modifications to our experiment are expected 
to increase the quantum link efficiency further. Refinements to the 
classical experimental control will allow us to reduce the duration of 
the entanglement attempt from 5.5 1s to less than 2 jus, which would 
more than double the entangling rate. Furthermore, the entangled-state 
coherence time could be improved substantially by exploiting long- 
lived nuclear-spin quantum memories'®*°?!, We anticipate that this 
will allow for link efficiencies in excess of 100 in the near future. Further 
improvements to the photon detection efficiency (including enhance- 
ment of the zero-phonon line emission)” would lead to an additional 
increase of at least an order of magnitude. 

In combination with recent progress on robust storage of quantum 
states during remote entangling operations'*“4, the techniques reported 
here reveal a direct path to the creation of many-body quantum states 
distributed over multiple quantum network nodes. Moreover, given 
the demonstrated potential for phase stabilization in optical fibre over 
distances of tens of kilometres”, our results open up the prospect of 
entanglement-based quantum networks at metropolitan scales. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 


Deterministically delivered entangled-state fidelity as a function of quantum 
link efficiency. We assume an entanglement-generation rate rent and an entangled- 
state decoherence rate rgec. If the rate at which entanglement attempts occur is 
much faster than rent (that is, there is a low probability of success), then we can 
approximate entanglement generation as a continuous process. In this case, the 
probability density for successfully generating entanglement at a time t after begin- 
ning our attempts is p(t) = Tone. The corresponding cumulative probability 
of success is p(t) =1—-e Tent! 

Once we succeed at creating entanglement, the state will decohere until the time 
at which we deliver it. For single-qubit depolarizing noise at each site, the fidelity 
of the resulting state after storage for a time t is 


0 
4 4 


If we deliver our entangled state at time fent = 3/Taec (where @ parameterizes the 
time in terms of the decoherence rate), then the average fidelity of the delivered 
state (given a success occurred) is 
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As discussed in the main text, we can choose to draw a black box around this 
process, delivering an unentangled state (state fidelity Funent < 1/2) for cycles in 
which no attempt at entanglement-generation succeeds so that a state is always 
delivered. This means that the states output from this black box will have a fidelity 
with respect to a Bell state Faget given by equation (1), where Feucc is as in the above 
equation. The maximum achievable fidelity when outputting a fully mixed state 
(Fanent = 1/4) upon failure Fy." (equation (2)) is found by optimizing Fyucc for a 
given quantum link efficiency mink. 

The full state of a quantum system can only be experimentally determined using 

an ensemble of identical states. This means that, in the absence of information 
about which deterministic entanglement-delivery cycles have a heralded success, 
the only accurate description of the output of such a black-box system is that a 
statistical mixture is deterministically output at each cycle. 
Experiment design. We use chemical-vapour-deposition homoepitaxially grown 
diamonds of type Ila with a natural abundance of carbon isotopes. Both diamonds 
were cut along the (111) crystal axis and were grown by Element Six. They are 
situated in custom-built confocal microscope set-ups within closed-cycle cryostats 
(4 K, Montana Instruments) separated by 2 m. We use fast microwave switches to 
shield both NV centres from microwave amplifier noise and therefore increase the 
coherence times substantially (node A uses Qorvo TGS2355-SM and node B uses 
Analogue Devices HMC544). All other parts of the set-up and sample details are 
described in the supplementary information of refs *!°. 

One cycle of the deterministic entanglement protocol consists of optical phase 
stabilization (described further below), charge-resonance checks to ensure that 
both NV centres are in the appropriate charge state and on-resonance”*, her- 
alded single-photon entanglement generation and finally dynamical decoupling 
to protect the state of the NV centres from their environment until the delivery 
time. The experimental sequences used in each step of this protocol (and the 
single-photon entanglement-generation experiment) are detailed in Extended 
Data Fig. 1. 

After delivery, the state of each NV centre is measured in a chosen basis. We use 
spin-selective optical readout of the NV-centre electron spin to determine its 
state in a single shot via the optical E, transition on both nodes**. We measure 
single-shot readout fidelities of 0.959(3) (0.950(3)) for the bright |m, = 0) = |T) 
ground state and 0.995(1) (0.996(1)) for the dark |m, = —1) = ||) state on node 
A (node B). These values are subsequently used to correct for readout errors of the 
electron spins in state-tomography measurements. 

Experiment control and communication logic. Extended Data Fig. 2 gives the 
decision trees and control logic for the ADwin microprocessors (Jaeger ADwin 
Pro II) that control the experiments. These microcontrollers are responsible for 


controlling all other experimental hardware and also communicate with each other 
to synchronize the experiment. 

Herald photon-detection window. We use a combination of polarization and 
temporal filtering to separate the excitation pulse from photons emitted by the NV 
centre. This necessitates a compromise between collecting as much of the emission 
light as possible, while ensuring that contamination from the pulse is minimized. 
In our experiment, we choose a temporal filter window (Extended Data Fig. 3) so 
that the pulse (assumed to have a Gaussian profile) is suppressed to the level of 
the detector dark counts by the beginning of the window. The end of the window 
about 30 ns after the pulse is chosen so that, for all of the datasets collected, the rate 
of detected NV-centre photons is greater than ten times the dark-count rate at all 
points within the window. We use a complex programmable logic device to apply 
this temporal filtering during our experiment and herald the successful generation 
of an entangled state in real time. 

Theoretical model of deterministic entanglement delivery. We develop a detailed 
model to determine the expected performance of the deterministic entangle- 
ment-delivery experiment, based on the independently measured parameters 
given in Extended Data Table 1. 

Once the set-ups are determined to be ready, the core entanglement sequence begins 
with single-photon entanglement generation. This proceeds by first initializing each 
node in |), followed by a coherent rotation using a microwave pulse to create the state 
given in equation (3). Resonant excitation of the NV-centre nodes excites only the 
bright |7) level to an excited state, which rapidly decays radiatively back to the ground 
state by emitting a single photon. This entangles the state of the NV centre with the 
presence (|1)) or absence (|0)) of a photon in the emitted optical mode (equation 
(4)). The photons emitted by each NV centre are transmitted to a central station at 
which a beam splitter is used to remove their ‘which path information. Successful 
detection of a photon at this station indicates that at least one of the NV centres is in 
the bright ||) state and thus heralds the creation of a spin-spin entangled state. This 
entangled state, expressed as [|NVaode a» NVnode p)» is (in un-normalized form) 
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This state is parameterized by 
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where V is the visibility of two-photon interference, pac is the dark-count proba- 
bility per detector (given by the product of the dark-count rate vga, and the 25-ns 
length of the detection window), and Ph, and Pe, are the probabilities of detecting 
a photon emitted by node A and node B, respectively. In the limit p,_| <1, for 
balanced detection probabilities p,. = ae = 2. , and assuming no other imper- 
fections, p tends to equation (5). 

The corresponding probability of successfully heralding entanglement is 
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The modelled success rate (plotted in Fig. 2e) is calculated by dividing Pneraia by 
the entangling-attempt duration (5.5 1s). 

We model double excitation (discussed further below) by applying a Pauli 
Z transformation to each of the NV-centre states with probability popy/2. Phase 
instability is modelled similarly, by applying a Pauli Z transformation to one of 
the states with probability 


2 2 
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where t, denotes the time since phase stabilization. 
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Finally, we model the effect of dynamical decoupling by assuming that it acts as 
a depolarizing channel for each qubit””. We therefore apply single-qubit depolar- 
izing errors with a probability determined by the measured dynamical-decoupling 
coherence times. For decoupling for a total time of tg, the total probability of a 
depolarizing error (that is, the application of a Pauli X, Y or Z transformation with 
an equal probability) is 3(1—e~ ta/T) /4. 

This model, based only on independently determined parameters (Extended 

Data Table 1), captures the trends of our deterministic entanglement generation 
data effectively (Fig. 4). However, we find that its predictions are slightly offset from 
the experimental measurements, suggesting that it does not include a small source 
of infidelity that is present in the experimental data. One potential origin of this 
discrepancy could be the increased number of attempts (up to two orders of mag- 
nitude) at generating entanglement after NV-centre state verification made here as 
compared to previous experiments”"”. Any additional sources of infidelity that may 
occur over this period (for example, owing to the passive charge-state stabilization 
process, discussed further below) are not included in the model. A detailed study 
of these potential imperfections is outside the scope of this work. Nonetheless, as 
an estimate of the order of this effect, we find that a small systematic correction 
of 3% to the heralded entangled-state fidelity is sufficient to effectively match our 
model to the data (Extended Data Fig. 4). 
Passive charge-state stabilization of individual NV centres. The negatively 
charged NV centre (NV_) can be ionized under optical illumination via a 
two-photon absorption process**. Owing to the different level structure of the 
neutral charge state NV°, the NV centre will remain dark if such an ionization event 
occurs during one of our entangling attempts. Ionization therefore hampers the 
performance of our deterministic entangling protocol by diminishing the success 
rate and delivery of a separable state upon success. Previous experiments" with 
NV centres that worked in the regime of probabilistically generated yet heralded 
remote entanglement overcame NV-centre ionization by frequent charge-state 
verification between protocols and by actively converting the NV centre back to 
NV°~ via interleaved resonant excitation of the optical transitions of NV°. 

Such active stabilization protocols would require additional logical overhead in 
our scenario, where entanglement is generated deterministically. Instead, we pas- 
sively stabilize the charge state during our entangling sequence by shining in an addi- 
tional weak laser beam that is resonant with the optical transition of NV° (Extended 
Data Fig. 5). This provides negligible disturbance to the spin-initialization 
fidelity of NV” while bringing the NV centre back into NV if it was converted to 
NV°. We additionally identify that the optical reset beam (duration, 1.5 1s) is the 
main cause of ionization in our system and carefully balance the power of both 
beams so that the spin state is still well initialized and that ionization is a negli- 
gible process for our deterministic entangling protocol (up to 15,000 entangling 
attempts). Reducing the applied power further by elongating the spin-reset dura- 
tion would decrease the entanglement rate and limit our quantum link efficiency. 

Extended Data Fig. 5 depicts the basic element that, in repetition, forms our 
sequence to probe the ionization rate. We use simultaneous charge- and spin-reset 
beams followed by a single microwave 7 rotation that brings the NV centre into 
||) and thus guarantees optical excitation during the next round. The NV centre 
is then read out after a final optical reinitialization into the bright state ||). By 
increasing the number of sequence repetitions, we observe a decay in the final 
readout fidelity that is associated with the ionization rate. By increasing the optical 
intensity of the charge-state reset beam, we obtain a negligible decay as a function 
of sequence repetitions, allowing us to overcome ionization in our deterministic 
entangling protocol. The illumination strength of the charge-reset beam is weak 
enough to avoid inducing noticeable spectral diffusion of the NV-centre emission; 
our measured entangled states are consistent with a high degree of indistinguish- 
ability for both NV-centre emission profiles (discussed further below). 
Optical-phase stabilization. The single-photon entanglement experiment requires 
that the optical phase of an effective interferometer between the two nodes is 
known (Fig. 2). The optical-phase difference between the paths of this interfero- 
meter must be known to ensure that entangled states are available for further use. 
This is achieved by interleaving periods of optical-phase stabilization with our 
entanglement generation. 

For phase stabilization we input bright laser light at the same frequency as, 
but orthogonally polarized to, the light used for excitation of the NV centres. The 
orthogonal polarization is chosen because we use a crossed polarizer to filter out 
the excitation light from the NV-centre emission. Using orthogonally polarized 
light for phase stabilization allows us to collect more light reflected from the dia- 
mond substrate. Before doing this, we verified that there is no measurable dif- 
ference in the relative phase of the two polarizations within our interferometer. 

Measurements of the phase drift (Extended Data Fig. 6a) show a slow drift on 
second timescales, but several strong resonances at hundreds of hertz (Extended 
Data Fig. 6b). These resonances are thought to be from mechanical elements in the 
path of the beam, including the microscope-objective mount. As we were unable to 
completely suppress these resonances in the current set-ups, we need to measure 
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the phase over a complete oscillation to estimate the mean phase reliably. The phase 
must therefore be measured for approximately 10 ms. 

We calculate an estimate of the phase from the counts detected at the herald- 
ing single-photon detectors. This estimate is used to adjust the phase back to our 
desired value using a -custom-built piezoelectric fibre stretcher and a proportion- 
al-integral-derivative routine within our ADwin microcontroller. We find that it 
takes two to three proportional-integral-derivative cycles to stabilize the phase 
optimally. We stabilize the phase for three cycles during the single-photon entan- 
glement experiment and for two cycles during the deterministic entanglement 
experiment. This difference is because phase stabilization occurs during every cycle 
of the deterministic entanglement-delivery experiment (about 100 ms), whereas it 
occurs only every 180 ms during the single-photon entanglement experiment and 
so the phase drifts slightly less after one experimental cycle. 

We achieve an average steady-state phase stability of 14.3(3)°, as measured by 
calibration routines spaced throughout the measurement of our dataset (Extended 
Data Fig. 6c, d). This stability is limited by the previously identified mechanical 
oscillations of the optical elements in our experimental set-up. The standard devia- 
tion of the phase averaged over a 10-ms period during active stabilization is 4.8(1)°. 

Optical phase stabilization is also likely to be feasible for long-distance network 

links. Using long-wavelength off-resonant light for phase measurements would 
enable continuous stabilization during entanglement attempts with a negligible 
effect on the NV-centre state. An experimental study” has shown that two network 
nodes separated by 36 km over a commercial fibre network would still allow for 
interference visibilities of 99%. For longer distances, it would also be possible to 
track the phase passively at the time of entanglement delivery and feed this infor- 
mation back to the nodes in which the state is stored, requiring only a coherence 
time longer than the communication time. 
Two-photon quantum interference. The quality of photon-mediated heralded 
entanglement between two emitters hinges on the indistinguishability of their 
emitted photons. We probe this indistinguishability by interfering emitted single 
photons on a beam splitter and measuring the number of events in which single- 
photon detectors connected to the output ports of the beam splitter both detect 
a photon. For completely indistinguishable single photons, Hong-Ou-Mandel 
interference ensures that both photons always exit from the same port of the beam 
splitter, so no coincident events should be detected. 

Our two-photon quantum interference experiment proceeds by exciting each 
emitter with a series of well-separated optical excitation pulses (separated by 1 
ps). We collect statistics on coincidence events in which one detector registers 
a photon after one excitation pulse and then the other registers a photon after a 
later excitation pulse. For an infinite pulse train, the number of coincidence events 
detected for each number of pulses between the detection events should be con- 
stant. However, for a finite pulse train, there are some pulses for a given pulse sep- 
aration for which there is no partner excitation pulse and therefore no coincident 
events will be detected. This leads to a linearly decreasing number of coincidence 
events as a function of pulse difference (Extended Data Fig. 7a). 

We use a linear fit to the coincidence events to infer the number of coinci- 
dences that would be detected from the same pulse (pulse difference of zero) if 
fully distinguishable single photons were input (Extended Data Fig. 7b). Because 
these are single photons, a counting argument shows that, for balanced emission 
probabilities from each emitter, the expected number of events is half of the value 
of the linear fit at zero pulse difference. 

The ratio r between the measured number of coincident events within the same 
pulse and the expected number of events for fully distinguishable photons is related 
to the single-photon wavefunction overlap V = |(7/1|2,)| > by V=(1 — nr) (again 
for balanced emission probabilities from each emitter). Incorporating the effect of 
the known imbalance in emission probabilities in our experiment, we find 
V=0.90(2). 

Dephasing of entangled states due to double excitation. An optical Rabi pulse 
is used to excite the NV-centre nodes to a higher level via a spin-conserving 
transition. The NV centre subsequently decays back to its original level through 
spontaneous emission, thereby entangling the spin state of the NV centre and the 
emitted optical mode. For optical Rabi pulses of finite duration, there is a chance 
that the NV centre will spontaneously emit a photon during the optical pulse and 
be re-excited before the end of the pulse. The first emitted photon will be lost to 
the environment, because it is impossible to distinguish it from the excitation 
light. However, if the subsequent emitted photon is detected in this double-excita- 
tion process, this will falsely herald entanglement. We measured the width of our 
optical pulse (Extended Data Fig. 8) and used a quantum-jump-based simulation 
to calculate the corresponding double-excitation probability. Given that the NV 
centre emitted a photon within the detection window, the probability that double 
excitation occurred is Poph = 0.04. 

State storage via dynamical decoupling. The coherence time of NV centres is lim- 
ited by interactions with other magnetic impurities. In our samples, the dominant 
source of magnetic field noise is the surrounding bath of slowly fluctuating °C 
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nuclear spins (natural abundance of 1.1%), which results in typical coherence times 
of 5 js. We use dynamical-decoupling “XY8’ sequences of the form (f-ny-2t-Ty-2t- 
Ty-2t-Ty-2t-Ty-2t-Ty—-2t-Ty-2t-Tx-1)™* to elongate the coherence times of both 
NV centres (Fig. 3), with microwave inversion pulses 1, the waiting time t and the 
number of pulses N. Each decoupling duration is obtained by arbitrary combina- 
tions of tand N. We find the optimal combination for a targeted protection duration 
of about 100 ms by varying t for a fixed N= 1,024. We choose N= 1,024 because 
the infidelity introduced from inversion-pulse errors is moderate for both nodes. 
Extended Data Fig. 9 shows the results of our decoupling-optimization proce- 
dure. We prepare the NV centre in a balanced superposition and choose waiting 
times that are integer multiples of the inverse '*C-nuclear-spin Larmor frequency 
v, to avoid coupling with the nuclear-spin bath (node A, 1, = 443.342 kHz; node 
B, v, = 442.442 kHz). Following previously reported techniques”, we further avoid 


coupling to other magnetic noise sources that result in loss of NV-centre coherence 
by picking five waiting times with a total variation of 16 ns for each multiple of 
the inverse Larmor frequency. The data (grey) are then sorted for the waiting time 
with the best state-preservation quality (blue) at each multiple, giving the minimal 
NV-centre coherence decay for this number of inversion pulses. We then pick the 
waiting time that guarantees a low number of inversion pulses while still providing 
high-quality state protection (red). 

Data availability. The data sets generated and analysed during this study are avail- 
able from the corresponding author on reasonable request. 


35. Aslam, N., Waldherr, G., Neumann, P., Jelezko, F. & Wrachtrup, J. Photo-induced 
ionization dynamics of the nitrogen vacancy defect in diamond investigated by 
single-shot charge state detection. New J. Phys. 15, 013064 (2013). 
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Extended Data Fig. 1 | Deterministic entanglement-delivery sequences. 


Pulse sequences for each step of the deterministic entanglement delivery 
protocol are shown. These sequences are also used in the single-photon 
entanglement-generation experiment. (1) Optical phase stabilization. 
Bright light is input to measure and stabilize the interferometer 

(see Methods). The duration is different for the single-photon 
entanglement experiment. (2) NV-centre state check. By shining in two 
lasers that are together resonant with transitions from all of the ground 
states, the NV centre will fluoresce regardless of its ground-state 
occupation. By counting photons emitted by the NV centre we can verify 
that both NV centres are in the desired charge state NV and that they are 
on resonance with the applied lasers. The NV centre is deemed to be on 
resonance if the number of photons detected during the charge/resonance 
check surpasses a certain threshold. If no photons are detected, then the 


(4) State preservation via dynamical decoupling 
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NV centre is assumed to be in the NV” state and a resonant laser is applied 
to reset it to NV. (3) Heralded single-photon entanglement generation. 
Entanglement generation proceeds by optically re-pumping the spins to 

|t) (including passive charge-state stabilization; see Methods) before a 
microwave (MW) pulse is used to create the desired bright-state 
population a at each node. A resonant excitation pulse then generates 
spin-photon entanglement. A subsequent microwave 7 pulse is used to 
ensure that the NV-centre state is refocused before the next stage should 
success be heralded. (4) Dynamical decoupling. Microwave pulses are used 
to implement dynamical decoupling (see Methods). (5) Single-shot 
readout. The NV-centre nodes can be read out in arbitrary bases in a single 
shot. If required, a microwave pulse is applied to rotate the qubit state 
before a resonant laser is applied. Fluorescence photons from the NV 
centre are detected if it is in the state |). 
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Extended Data Fig. 2 | Flowchart of the experimental sequences. 

The decision trees of the ADwin microprocessors (Jaeger ADwin Pro 

II) that create the overarching measurement and control loops for 
network nodes A and B are shown. Both nodes use arbitrary-waveform 
generators (AWGs) for microwave and laser pulse sequencing (Tektronix 
AWG5014C). We additionally use a complex programmable logic 

device (CPLD) to herald the successful generation of an entangled 

state in real time (described further in Methods). a, Decision tree when 
benchmarking the entangled state. b, Deterministic entanglement 
delivery. Here the ADwin microprocessors keep track of the time since 
the end of the phase stabilization (t= 0). ‘CR check’: as explained in 
Extended Data Fig. 1, the NV centre is deemed to be on resonance 

with the excitation lasers if the number of photons detected during the 
charge/resonance check surpasses a certain threshold (np» > thr); this 

is repeated until the threshold is passed. ‘comm? and ‘comm. timeout’: 
both ADwin microprocessors exchange classical communication, such 
as the success of the charge/resonance check, via a three-step-handshake; 
if one microprocessor waits longer than 1 ms for a response from its 
counterpart, then the communication times out and we return to the 
previous logical step (arrow). ‘Count attempts’: the number of entangling 
attempts N are counted until N= Nmax. ‘Count dec. time’: the time since 


comm. |timeout 


Complete cycle 1500 times 


phase stabilization is tracked; if the time is equal to the pre-specified 
state-generation time fen, then the AWG is triggered and the local readout 
sequences are executed. ‘Wait for basis rot?: ADwin microprocessors wait 
for a trigger input from the AWG (‘AWG done’), which indicates that 

the last microwave rotation before optical readout has been completed. 
“Trigger AWGs’: the microprocessor of node A triggers the AWGs of both 
nodes to initiate the microwave and entangling sequences; we use a single 
microprocessor as the trigger source to avoid timing jitter between both 
generated sequences. ‘SSRO’: optical single-shot readout. ‘Suc.: Trig. from 
CPLD and ‘ail: Trigger from AWG’: during entanglement generation, the 
CPLD communicates successful detection of a photon to the nodes; during 
the single-photon entanglement-benchmarking experiment, the AWG 

at each node flags failure of the round after 250 entangling attempts. “Do 
stabilize?’: The microprocessors communicate that phase stabilization will 
be the next step in the experimental sequence; the microprocessor at node 
A then proceeds with the phase stabilization while that at node B waits 
until the phase stabilization has finished. The deterministic entangling 
sequence is run a total of 1,500 times (500 times per readout basis) before a 
new round is called in, which starts again with the verification of resonant 
conditions for both NV centres. 
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Extended Data Fig. 3 | Temporal filtering of photons. Histograms are counted as valid entanglement events. The green line shows a Gaussian fit 
shown of the times at which photons are detected at each single-photon to the pulse with a full-width at half-maximum of 2.26 ns as measured in 
detector (blue) during a deterministic entanglement-delivery experiment Extended Data Fig. 8. This is used to estimate the contribution of residual 
with bright-state population a = 0.12. The orange histograms show the pulse photons within the filter window. 


photons that were detected within the temporal filter window and so were 
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Extended Data Fig. 4 | Comparison of experimental model and data. 
Including a 3% source of infidelity in our model (which otherwise consists 
only of independently determined parameters) is sufficient to account for 
the offset observed between our model and our experimental data. a, The 
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modified model, plotted with experimental data reproduced from Fig. 4. are 1 s.d. 
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Dashed lines show the model given in the main text (without the infidelity 
parameter). b, This infidelity also applies to the model shown in Fig. 3, 
because an equally large number of entanglement repetitions was used in 
generating the data. Error bars for data and shaded model uncertainties 
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NV~.a, Elementary sequence to probe the NV-centre ionization rate. the spin initialization of NV” is not affected by the additional blue- 
b, Applying our sequence many times results in decay of the NV-centre detuned beam and ionization is effectively mitigated over thousands of 
readout fidelity due to ionization (error bars represent 1 s.d.; lines show trials. 


fitted exponential decays as guides to the eye). By exploring the ionization 
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resonances of components in the set-up. c, Active phase stabilization is the measured post-entanglement optical phases (blue). A Gaussian fit with 
used to correct for phase drifts. Here the phase is stabilized and then the a standard deviation fixed to the average measured standard deviation for 
interferometer is allowed to passively drift for 2 s. The standard deviation all entanglement data taken, ojnt = 14.3(3)°, is also plotted (orange). 


of the phase as a function of time is plotted for a dataset of 100 of these 
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Extended Data Fig. 7 | Two-photon quantum interference. a, Histogram 
for coincident events measured by two single-photon detectors in a two- 
photon quantum interference experiment, measured by cross-referencing 
photon detection events from a pulse train of 10 optical x pulses that 
excite both emitters. Hong-Ou-Mandel interference of simultaneously 
coinciding photons ideally results in vanishing coincidence events within a 
single excitation round. The time difference between individual excitation 
rounds is 1 jus. Histograms of coincidence counts are shown with a bin-size 


of 5 ns. b, Total number of coincidences as a function of the number of 
pulses separating the two detection events. We extrapolate the measured 
coincidences to infer the expected coincidences for distinguishable 
photons at zero pulse difference by fitting a linear regression (orange). 
Using this to normalize the 22 observed coincidences for zero pulse 
difference allows us to estimate the two-photon quantum interference 
visibility V=0.90(2). Error bars are 1 s.d. 
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Extended Data Fig. 8 | Width of the optical 7 pulse. We find a full-width 
at half-maximum of 2.26 ns. This measurement is necessary to compute 
the dual-excitation probability (Extended Data Table 1), given a radiative 
lifetime of 12 ns. Error bars are 1 s.d. 
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Extended Data Fig. 9 | Determining the optimal inter-pulse delay for 
state storage and 1,024 inversion pulses. We initialize a superposition 
state on the NV-centre electron spin, preserve it via dynamical decoupling 
and finally perform optical readout after another 1/2 pulse. We probe the 
coherence of the NV centre by varying the inter-pulse delay t in steps of 
.25 ws and shifting the delay in steps of 4 ns for 


the Larmor period 1/1, = 2 
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Evolution time (ms) 


50 100 150 
Evolution time (ms) 


a total of five data points per Larmor period (grey). For each multiple of 
the Larmor period we pick the best (the most preserving) inter-pulse delay 
(blue). We determine the optimal delay t by selecting an inter-pulse delay 
that provides sufficient state preservation (about 100 ms) for a moderate 
number of pulses (red data point and inset text). Left, node A; right, 

node B. Error bars are 1 s.d. 
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Extended Data Table 1 | Independently measured experimental parameters for the performance of the nodes used in our experiment 


Node A Node B | Description 
T> (ms) 290(20) 680(70) | Dephasing time of the electron spin state. 
T (s) >1 >1 Relaxation time of electron spin eigenstates. 
Paet (10-4) | 2.8(1) 4.2(1) | Probability to detect a ZPL photon after a single excitation. 
Desritze <10-© <10°-® | Probability of passive charge-state control failure per entangling attempt 
(detailed in methods). 
t (us) 40.320 36.148 | Optimized inter-pulse delay for state storage. 
Fo 0.959(3) 0.950(3) | Fidelity of the electron read-out for |‘). 
Fs 0.995(1) 0.996(1) | Fidelity of the electron read-out for |). 
V 0.90(2) Visibility of the two-photon quantum interference (detailed in methods). 
P2ph 0.04 Estimated probability of double excitation during the optical a-pulse 
(detailed in methods). 
Vaark (Hz) 20 Dark count rate per detection channel. 
Clnt 14.3(1)° Initial uncertainty of the interferometric drift (detailed in methods). 
Vint (/8) ~ 20° Estimated drift rate of the free running interferometer (detailed in meth- 


ods). 
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Printing ferromagnetic domains for untethered 
fast-transforming soft materials 


Yoonho Kim!*°, Hyunwoo Yuk! 5, Ruike Zhao), Shawn A. Chester? & Xuanhe Zhao** 


Soft materials capable of transforming between three-dimensional 
(3D) shapes in response to stimuli such as light, heat, solvent, 
electric and magnetic fields have applications in diverse areas such 
as flexible electronics’, soft robotics** and biomedicine>’. In 
particular, magnetic fields offer a safe and effective manipulation 
method for biomedical applications, which typically require remote 
actuation in enclosed and confined spaces®*'°. With advances in 
magnetic field control!', magnetically responsive soft materials have 
also evolved from embedding discrete magnets!” or incorporating 
magnetic particles!’ into soft compounds to generating nonuniform 
magnetization profiles in polymeric sheets'*!°. Here we report 
3D printing of programmed ferromagnetic domains in soft 
materials that enable fast transformations between complex 3D 
shapes via magnetic actuation. Our approach is based on direct 
ink writing’® of an elastomer composite containing ferromagnetic 
microparticles. By applying a magnetic field to the dispensing 
nozzle while printing!’, we reorient particles along the applied field 
to impart patterned magnetic polarity to printed filaments. This 
method allows us to program ferromagnetic domains in complex 
3D-printed soft materials, enabling a set of previously inaccessible 
modes of transformation, such as remotely controlled auxetic 
behaviours of mechanical metamaterials with negative Poisson’s 
ratios. The actuation speed and power density of our printed soft 
materials with programmed ferromagnetic domains are orders of 
magnitude greater than existing 3D-printed active materials. We 
further demonstrate diverse functions derived from complex shape 
changes, including reconfigurable soft electronics, a mechanical 
metamaterial that can jump and a soft robot that crawls, rolls, 
catches fast-moving objects and transports a pharmaceutical dose. 

Our composite ink for 3D printing consists of magnetizable 
microparticles of neodymium-iron-boron (NdFeB) alloy (Extended 
Data Fig. 1a) and fumed silica nanoparticles (Extended Data Fig. 1b) 
embedded in a silicone rubber matrix containing silicone catalyst and 
crosslinker (Fig. 1a). The fumed silica within the silicone resin serves as 
a rheological modifier to induce the mechanical properties required for 
direct ink writing*!° including shear thinning (Extended Data Fig. 2a) 
and shear yielding (Extended Data Fig. 2b). These properties ensure 
that the composite ink can be extruded through a micro-nozzle when 
pressurized and that the deposited inks maintain their shapes even 
when stacked up to form multiple layers. The composite ink is prepared 
first by mixing the non-magnetized NdFeB particles and the silica nan- 
oparticles with the uncured elastomer matrix and then magnetized to 
saturation under an impulse field (about 2.7 T). The presence of yield 
stress in the composite ink helps to prevent the dispersed magnetized 
particles from agglomerating (Extended Data Fig. 3a). 

During the printing process, a magnetic field is applied along (or 
in reverse to) the flow direction of the ink via a permanent magnet or 
an electromagnetic coil placed around the dispensing nozzle (Fig. 1a). 
The applied field makes the magnetized NdFeB particles reorient along 
the field direction, imparting a permanent magnetic moment to the 
extruded ink filament. The magnetic polarities of the deposited inks 


can be tuned either by switching the applied field direction or chang- 
ing the printing direction. Using this approach, a 3D structure can 
be encoded with intricate patterns of ferromagnetic domains depend- 
ing on the magnetic polarities of the filaments that are arranged to 
construct the 3D structure. To avoid interference in the programmed 
domains of the printed structure by the applied field at the nozzle, a 
magnetic shield is used to attenuate the magnetic flux density under the 
nozzle tip (Fig. 1a). When the printing process is complete, the printed 
structure is cured at 120°C for 1 h, during which the presence of yield 
stress in the uncured ink helps the programmed ferromagnetic domains 
to remain unaffected by thermal randomization of the aligned particles. 

To evaluate the efficacy of our method in printing ferromagnetic 
domains, we measure the magnetic moment density, or magnetization, 
in samples (Extended Data Fig. 4) printed under various conditions 
including the magnetic particle content, the applied field strength and 
the nozzle diameter. First, samples are printed with magnetic inks con- 
taining different volume fractions of NdFeB particles through a nozzle 
with diameter 410 jm under a magnetic field of 50 mT at the nozzle tip. 
The measured magnetic moment density varies almost linearly from 
16 kA m7! to 81 kA m“! as the volume fraction of NdFeB particles 
in the composite ink increases from 5% to 20% (Fig. 1b). Next, as the 
applied field at the nozzle tip increases from 20 mT to 50 mT, the mag- 
netic moment density of printed samples (with 20 vol% NdFeB particles 
through nozzles with diameter 410 1m) increases from 68 kA m~! to 
81 kA m“! (Fig. 1c). When the nozzle diameter varies from 200 1m 
to 1,190 1m, the magnetic moment density of printed samples (with 
20 vol% NdFeB particles under a magnetic field of 50 mT at the noz- 
zle tip) increases from 76.6 kA m~! to 85.4 kA m7! (Fig. 1d). When 
printed with very fine nozzles (with diameters of 50 jum and 100 tm), 
the fibre diameter becomes larger than the nozzle diameter owing to 
the die-swelling effect (Extended Data Fig. 1c, d). The ratio between the 
fibre and nozzle diameters decreases as the nozzle diameter increases, 
reaching almost one when the nozzle diameter is larger than 200 1m 
(Extended Data Fig. le-g). Printing in the absence of external mag- 
netic fields yields magnetization values below 5 kA m~ for all nozzle 
diameters, because the particles are randomly oriented. Furthermore, 
we print samples in the absence of external field and then magnetize 
them under impulse fields (about 2.7 T) after curing, which yields the 
maximum achievable magnetic moment density at each volume frac- 
tion of NdFeB particles. In comparison, printing under magnetic fields 
of 50 mT yields a magnetic moment density that corresponds to about 
63%-64% of the maximum achievable value at the same concentration 
of NdFeB particles (Extended Data Fig. 5). 

We develop a model to predict the transformation of complex 
3D-printed structures with programmed ferromagnetic domains 
under magnetic fields. Application of magnetic fields induces torques 
on the embedded ferromagnetic particles, which create stresses that 
collectively lead to a macroscale material response. If the magnetic 
moment density (magnetization), a vector quantity, is M at a certain 
point of an incompressible body in the reference configuration, the 
magnetization vector at the same material point in the deformed body 
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Design 
Fig. 1 | Design of ferromagnetic domains in 3D-printed soft materials. 
a, Schematics of the printing process and the material composition. The 
ferromagnetic particles embedded in the composite ink are reoriented 
by the applied magnetic field generated by a permanent magnet or an 
electromagnet placed around the dispensing nozzle. b, Effect of the volume 
fraction of magnetized NdFeB particles in the ink on the magnetization of 
printed samples. c, Effect of the applied field strength around the nozzle 
on the magnetization of printed samples. d, Effect of nozzle diameter on 
the magnetization of printed samples. Samples printed in the absence of 
applied magnetic fields give magnetization values below 5 kA m7. Error 


can be expressed as FM, where F denotes the deformation gradient 
tensor at the point. Then, the magnetic potential energy per unit ref- 
erence volume under an applied magnetic field B can be expressed 
as Wmagnetic — _FM.-B, under the assumptions that the presence of 
soft materials does not substantially alter the applied field and that 
the potential energy from higher-order terms of B and M are negli- 
gible. From the magnetic potential energy density, the Cauchy stress 
tensor induced by the applied field on the magnetic moments can be 
calculated as o™8n¢tic — —B@FM, where the operation ® denotes the 
dyadic product, which takes two vectors to yield a second-order tensor. 
To simulate the deformation of complex structures programmed with 
ferromagnetic domains, the magnetic stress tensor is implemented as 
a user-defined element subroutine in the commercial finite-element 
analysis software ABAQUS (details are available in Supplementary 
Information). 


Applied magnetic field, B (mT) 
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bars indicate the standard deviation for n = 3 measurements at each data 
point. e-g, Schematic illustration (e), simulation of the finite-element 
model (f) and experimental results (g) of a single fibre encoded with 
alternating magnetic domains designed to form an ‘m shape in 0.1 s under 
an applied magnetic field of 200 mT. The elastomeric fibre is printed using 
a nozzle of diameter 840 j1m while switching the direction of magnetic 
fields (50 mT at the nozzle tip) generated by an electromagnetic coil that 
encompasses the nozzle. All samples discussed in c, d and g are prepared 
with the elastomeric composite ink containing 20 vol% of magnetized 
NdFeB particles. 


As an illustrative example to demonstrate the ability to program fer- 
romagnetic domains, a straight filament is printed with an alternating 
magnetization pattern as illustrated in Fig. le by switching the applied 
field direction during the printing. Upon application of a uniform mag- 
netic field of 200 mT, the straight filament transforms into an ‘n’ shape 
in 0.1 s (Fig. 1g), and quickly reverts to its original shape upon removal 
of the applied field in 0.2 s. Such rapid, reversible transformation can be 
repeated on demand by magnetic actuation (Supplementary Video 1). 
The simulation conducted under the same conditions, including the 
magnetic and mechanical properties and the applied field as in the exper- 
iment, is in good agreement with the experimental results (Fig. 1f), vali- 
dating the use of model-based simulation to guide the design of complex 
shape-morphing structures with programmed ferromagnetic domains. 

In Fig. 2, we present a set of two-dimensional planar structures that 
rapidly transform into complex 3D shapes under the applied magnetic 
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Fig. 2 | Various two-dimensional planar structures with programmed 
ferromagnetic domains demonstrating complex shape changes 

under applied magnetic fields. a-r, Schematic designs, finite-element 
simulations and experimental results for an annulus encoded with 
alternating domains that are equidistant (a-c); an annulus encoded with 
alternating domains that vary in size (d-f); a Miura-ori fold encoded with 
alternating oblique patterns of ferromagnetic domains (g-i); a hollow 
cross encoded with alternating ferromagnetic domains along the perimeter 
(j-]); quadrupedal (m-o) and hexapedal (p-r) structures enabled by 


fields of 200 mT as a result of the programmed ferromagnetic domains. 
In Fig. 2a and d, we design two annular rings with the same geome- 
try but different patterns of ferromagnetic domains to illustrate the 
effects of programmed domains on the macroscale response. Our mod- 
el-based simulation predicts that the two rings should yield different 3D 
morphologies under the same magnetic field applied perpendicularly 
to their planes. The second annulus encoded with alternating patterns 
that vary in magnitude gives a more complex undulating shape (Fig. 2e) 
than does the first annulus (Fig. 2b), whose alternating patterns are 
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folding of the magnetically active segments surrounding the magnetically 
inactive segments (unlabelled areas in the schematic designs). All of 

the demonstrated structures are printed with the elastomeric composite 
ink containing 20 vol% of magnetized NdFeB particles using a nozzle 

of diameter 410 pm under a magnetic field of 50 mT at the nozzle tip 
generated by a permanent magnet. Actuation of the demonstrated 
structures is performed by applying magnetic fields of 200 mT 
perpendicular to the planes of the structures. The detailed dimensions of 
the printed structures are given in Extended Data Fig. 6a-f. 


equidistant. The simulation results are in good agreement with exper- 
imental results (Fig. 2c, f, Supplementary Video 2), further demon- 
strating that our model is capable of guiding the design of complex 
shape-morphing structures using programmed ferromagnetic domains. 

When programmed with more intricate domain patterns, even a 
simple geometry can yield a complex 3D shape under an applied mag- 
netic field. As an example, in Fig. 2g, we design a simple rectangular 
structure with alternating oblique patterns of ferromagnetic domains 
to create a Miura-ori pattern'®, This untethered structure provides fast 
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Fig. 3 | Various 3D structures with programmed ferromagnetic 
domains demonstrating complex shape changes under applied 
magnetic fields. a—o, Schematic designs, finite-element simulations and 
experimental results for two adjoining hexagonal tubes programmed to 
form undulating surfaces under the applied magnetic field owing to the 
alternating ferromagnetic domains (a—c); a pyramid-shaped thin-walled 
structure exhibiting elongation in its diagonal direction along the applied 
magnetic field (d-f); a set of auxetic structures (with negative Poisson's 


(in 0.3 s) and fully reversible folding and unfolding under magnetic 
actuation (Fig. 2i and Supplementary Video 2), as predicted by our 
model (Fig. 2h). Notably, the response speed of our Miura-ori structure 
with programmed ferromagnetic domains is much faster than exist- 
ing ones in the literature based on liquid crystal elastomers!®”°, shape 
memory polymers”)? and thermally responsive hydrogels”. 

When more intricate designs are programmed with ferromagnetic 
domains, as illustrated in Fig. 2j, m and p, the two-dimensional planar 
structures transform into more complex 3D shapes (Fig. 21, 0, r and 
Supplementary Video 2); for these it is no longer straightforward to 
trace the original shapes without knowing the programmed domains. 
Design and realization of such complex shape-morphing structures 
are enabled by the model-based simulations, which accurately pre- 
dict the complex 3D morphologies (Fig. 2k, n, q and Supplementary 
Video 2). Previously, a transition from two-dimensional planar struc- 
tures to complex 3D shapes has been achieved by controlled buck- 
ling of materials that are selectively attached on biaxially stretched 
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ratios) exhibiting shrinkage in both length and width under applied 
magnetic fields (g—o). All of the demonstrated structures are printed 

with elastomeric composite ink containing 20 vol% of magnetized NdFeB 
particles using a nozzle of diameter 410 jum under a magnetic field of 

50 mT at the nozzle tip, generated by a permanent magnet. Actuation of 
the demonstrated structures is performed by applying magnetic fields 

of 200 mT along the directions indicated in a, d, g, j and m. The detailed 
dimensions of the printed structures are given in Extended Data Fig. 6g-k. 


elastomeric substrates***°. Compared with existing techniques, our 
method based on printing ferromagnetic domains offers additional 
advantages in two-dimensional to 3D structural transition, including 
(i) substrate-free, remote actuation, (ii) fast and fully reversible trans- 
formation and (iii) the capability to selectively actuate specific parts 
of the structure. 

Our method of printing ferromagnetic domains can be further 
extended to complex 3D structures. When printing 3D structures with 
direct ink writing, however, difficulties typically arise owing to struc- 
tural instability as the deposited filaments are stacked up. To ensure a 
more stable printing process, we introduce a support ink composed 
of a silicone resin containing catalyst and fumed silica nanoparticles 
(Extended Data Fig. 7; see Extended Data Fig. 2a and b for rheological 
properties). When printed, the support ink serves as a fugitive sup- 
port that buttresses the adjacent magnetic ink (Extended Data Fig. 7a 
and Supplementary Video 3). After the magnetic ink is fully cured, the 
support ink can be removed by solvent rinses (Extended Data Fig. 7c). 


14 JUNE 2018 | VOL 558 | NATURE | 277 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Field off —-» Mode1 ——~> 


3) 


At=0.0s 0.2 0.3s 


At=0.0s 3.5s 455s 5.78 


Fig. 4 | Functional demonstrations of 3D-printed soft materials with 
programmed ferromagnetic domains. a, A reconfigurable soft electronic 
device (as detailed in Extended Data Fig. 9) based on the annular ring 
structure exhibiting different electronic functions depending on the 
direction of an applied magnetic field of 30 mT. b, A hexapedal structure 
stopping and holding a fast-moving object (glass ball of diameter 

18 mm and weight 8 g) upon application of a magnetic field generated 


The use of support ink and the consequent ability to print 3D struc- 
tures with programmed domains allow us to create a set of high-aspect- 
ratio multilayered structures (Fig. 3) that exhibit rapid and reversible 
transformation between complex 3D shapes under magnetic fields 
of 200 mT. In Fig. 3c, we present a thin-walled structure consisting 
of two adjoining hexagonal tubes with high aspect ratios. The ferro- 
magnetic domains are programmed in such a way that some parts of 
the tubes expand while the others collapse, as illustrated in Fig. 3a, 
to create complex undulating surfaces in a continuous 3D structure 
under the applied magnetic field, as predicted and observed by the 
simulation and the experiment (Fig. 3b, c and Supplementary Video 4), 
respectively. In another example, to demonstrate the versatility of our 
fabrication method, we create a pyramid-shaped thin-walled structure 
that elongates along the direction of applied magnetic fields (Fig. 3e, f 
and Supplementary Video 4) as a result of the programmed magnetic 
domains (Fig. 3d). 

The versatility of our model-guided design and fabrication method 
enables us to create auxetic structures (Fig. 3i, 1, 0), a type of mechan- 
ical metamaterials characterized by negative Poisson’s ratios. Our 
printed auxetic structures exhibit shrinkage in both length and width 
in response to external magnetic fields. Typically, mechanical meta- 
materials show auxetic behaviours only when uniaxially compressed 
or stretched and thus require direct mechanical contact*®. In addition, 
owing to the limited fabrication techniques available to achieve com- 
plex designs, remote actuation of untethered auxetic structures has 
not been realized in other types of active materials. Guided by our 
model-based predictions (Fig. 3h, k, n), we design a set of mechanical 
metamaterials with programmed ferromagnetic domains (Fig. 3g, j, m) 
that quickly shrink in both length and width under the applied fields 
within 0.5 s and recover their original shapes upon removal of the 
applied fields (Fig. 3i, 1, o and Supplementary Video 5). The use of 
magnetic fields as an actuation method obviates the need for direct 
contact in realizing auxetic behaviours in mechanical metamaterials. 

In the design and fabrication of shape-programmable soft materials, 
intensive efforts have been made to increase the level of complexity by 
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by a permanent magnet. c, A hexapedal structure wrapping an oblong 
pharmaceutical pill and carrying the pill using rolling-based locomotion 
under a rotating magnetic field generated by a permanent magnet. 

d, Horizontal leap of a 3D auxetic structure upon sudden reversal of the 
applied magnetic field direction while attenuating the field strength by 
rotating a permanent magnet by 90°. Detailed information on how to apply 
the magnetic fields to achieve actuation is given in Extended Data Fig. 10. 


acon ee 3D printing techniques such as inkjet printing”, stereolitho- 
and direct ink writing*®. However, fast and fully reversible 
actuation between programmed shapes has remained a central chal- 
lenge in the field. To quantitatively evaluate the actuation performance, 
we compare the energy density and the actuation rate (Extended Data 
Fig. 8a) of printed shape-programmable materials in the literature. We 
also compare the power density (Extended Data Fig. 8b), one often- 
used metric to evaluate the actuation performance of active materials. 
Our shape-morphing structures shown in Figs. 2 and 3 deform up to 
strain levels from 0.15 to 0.25 within 0.1 s to 0.5 s, providing a power 
density ranging from 22.3 kW m7? to 309.3 kW m~?, which are orders 
of magnitude greater than the actuation rates and power densities 
achieved by existing 3D-printed shape-transforming soft materials. 

The capability to create complex shape changes allows us to achieve 
diverse functions from our printed structures, as shown in Fig. 4. First, 
by combining electronic components and circuitry with our annular 
ring structure in Fig. 2c, we print a soft electronic device as detailed 
in Extended Data Fig. 9a. This soft electronic device deforms into two 
different shapes depending on the direction of applied magnetic fields 
of 30 mT, and each mode of transformation yields a different elec- 
tronic function (Fig. 4a, Extended Data Fig. 9b, c and Supplementary 
Video 6). The results demonstrate that our multimaterial 3D printing 
method gives functionally reconfigurable soft electronic devices, whose 
rigid-material counterparts have recently been achieved by means of 
multistable buckling*’. 

We further demonstrate the capability of interacting with an object 
based on the complex shape changes of the hexapedal structure 
shown in Fig. 2r. Using the fast response upon magnetic actuation, 
the hexapedal structure quickly stops a fast-moving object (Fig. 4b 
and Supplementary Video 7). When applying a magnetic field in the 
opposite direction to create a reversed shape, the hexapedal structure 
can catch a falling object and hold it against external disturbance and 
then release the object on demand by using the previous mode of 
transformation (Supplementary Video 7). When a rotating magnetic 
field is applied, the hexapedal structure can roll up its body and move 
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forwards and backwards by rolling-based locomotion (Supplementary 
Video 8). Harnessing the shape changes and motion, the hexapedal 
structure can carry an object with arbitrary shape such as a round or 
oblong pharmaceutical pill (Fig. 4c) and release the pill on demand 
(Supplementary Video 8). 

The 3D mechanical metamaterial presented in Fig. 3i can show a 
horizontal leap based on the drastic release of the elastic and mag- 
netic potential energy (Fig. 4d and Supplementary Video 9). The 
fast response of the auxetic structure generates an average speed of 
250 mm s~! during the leap, allowing it to move forwards by 120 mm 
within 0.7 s on the horizontal plane. This leaping motion is achieved by 
first applying a magnetic field in one direction to collapse the auxetic 
structure and then switching to a field in the opposite direction while 
attenuating the field strength. This sudden reversal of the field direction 
quickly increases the magnetic potential energy and triggers the drastic 
release of the stored elastic and magnetic potential energy, which is 
converted to kinetic energy during the horizontal leap. 

Our printing method as a fabrication platform can be extended to 
multiple composite inks using different types of elastomer and hydrogel 
matrices and magnetic particles. By printing ferromagnetic domains in 
soft materials, we introduce new design parameters—domain patterns, 
magnetization strength and actuation fields—into the design and fabri- 
cation of shape-programmable soft materials. The remote actuation of 
such untethered, complex and fast shape-shifting soft materials based 
on magnetic fields suggests new possibilities for applications in flexible 
electronics, biomedical devices and soft robotics. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0185-0. 
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METHODS 


Ink composition and preparation. The magnetic ink was prepared first by blend- 
ing two silicone-based materials—SE 1700 (Dow Corning Corp.) and Ecoflex 
00-30 Part B (Smooth-on Inc.)—in a 1:2 volume ratio. Ecoflex 00-30 Part B, a 
softer elastomer than SE 1700, was used to achieve the preferred mechanical 
properties of the composite material. Fumed silica nanoparticles (amorphous, 
20-30 nm; US Research Nanomaterials Inc.), which corresponds to 12.5 wt% 
with respect to Ecoflex Part B, were added to achieve required rheological prop- 
erties for direct ink writing. After mixing the blend in a planetary mixer (AR-100, 
Thinky) at 2,000 r.p.m. for 2 min, 20 vol% NdFeB microparticles (287.5 wt% with 
respect to Ecoflex Part B) with an average size of 5 jzm (MQFP-B-2007609-089, 
Magnequench) were added into the elastomer mixture and then mixed thoroughly 
at 2,000 r.p.m. for 3 min, followed by defoaming at 2,200 r.p.m. for 1 min. The com- 
posite ink was then magnetized by impulse magnetic fields (about 2.7 T) generated 
by an impulse magnetizer (IM-10-30, ASC Scientific) to impart magnetic polarities 
to the ferromagnetic particles embedded in the elastomer matrix. Both SE 1700 
and Ecoflex 00-30 are platinum-catalysed, addition-curing silicones, so 10 wt% 
SE 1700 catalyst with respect to SE 1700 base was added into the magnetized ink 
and then mixed at 2,000 r.p.m. for 30 s before printing. The final concentrations of 
components were as follows: 21.78 wt% Ecoflex 00-30 Part B, 2.72 wt% fumed silica 
nanoparticles, 11.71 wt% SE 1700 base, 1.17 wt% SE 1700 catalyst and 62.62 wt% 
NdFeB microparticles. For imaging purposes, about 2 wt% fluorescent colourants 
(Ignite PMS 805C, Smooth-on Inc.) were added to this final composition. 

The support ink, which was used for supporting structures when printing 
multilayered or 3D structures with the magnetic ink, was prepared by mixing a 
platinum-based silicone-curing accelerator (Elastosil CAT PT-F, Wacker) with 
fumed silica nanoparticles (amorphous, 20-30 nm; US Research Nanomaterials 
Inc.) in a 5.45:1 mass ratio. Fumed silica nanoparticles were added to achieve the 
rheological properties required for direct ink writing of the support ink. The higher 
concentration of catalyst in the support ink prevents diffusion of catalyst molecules 
from the adjacent magnetic inks, and therefore helps prevent imperfect curing of 
the printed magnetic structures. After the magnetic inks were fully cured upon 
heating at 120°C for 1 h, the fugitive support ink was removed by rinsing with 
isopropyl alcohol using an orbital shaker (Micro Plate Shaker, VWR). 

Printing procedure. The prepared magnetic and support inks were loaded into 
syringe barrels and defoamed at 2,200 r.p.m. for 1 min to remove trapped air bub- 
bles. The inks were then mounted to the custom-designed 3D printer based on a 
Cartesian gantry system (AGS1000, Aerotech). Conical nozzles with inner diam- 
eter 410 sm (Smoothflow Tapered Tip, Nordson EFD) were used for both inks in 
our demonstrations (Figs. 2 and 3). The detailed designs including ferromagnetic 
domain patterns and dimensions of the printed structures in Figs. 2 and 3 are given 
in Extended Data Fig. 6. The external magnetic fields applied at the nozzle to reor- 
ient the magnetic particles embedded in the ink during printing were generated by 
either an electromagnet or a permanent magnet. Printing paths were generated by 
CAD drawings (SolidWorks, Dassault Systemes) and converted into G-code by a 
commercial software package (CADFusion, Aerotech) and custom Python scripts 
to command the x-y-z motion of the printer head. See Supplementary Video 3 for 
overall printing and actuation processes. 

Rheological characterization. Rheological responses (Extended Data Fig. 2a, b) 
of the magnetic and support inks were characterized using a rotational rheometer 
(AR-G2, TA Instruments) with a 20-mm-diameter steel plate geometry. For mag- 
netic inks, both magnetized and nonmagnetized samples were tested to evaluate 
the effects of magnetic interaction between the embedded magnetized particles. 
Apparent viscosities were measured via steady-state flow experiments with a sweep 
of shear rates (0.01-100 s~1). Shear storage moduli were measured as a function of 
shear stress via oscillation experiments at a fixed frequency of 1 Hz with a sweep 
of stress (10-10,000 Pa). The magnetic and support inks were equilibrated at 25°C 
for 1 min before testing, and all experiments were performed at 25°C with a gap 
height of 0.5 mm. 

Magnetic characterization. The quality of alignment of the magnetic particles 
was evaluated by measuring the magnetic moment density (magnetization) of 
the printed samples with a vibrating sample magnetometer (DMS 1660, ADE 
Technologies). To prepare specimens, a set of parallel lines was printed in the 
same direction to construct a rectangular film. Then, the printed film was cut 
into circles using a 6-mm biopsy punch (Miltex Inc.) to fit into the sample holder 
of the machine. The magnetic moments of the samples were measured against a 
sweep of external magnetic fields from —8,000 A m! to 8,000 A m~!, Remanent 
magnetization, which corresponds to the measured magnetic moment when the 
applied external field is zero, was divided by each specimen’s volume to obtain the 
magnetic moment density of the specimen. 

Mechanical testing. Two types of rectangular planar sheets (width 12 mm, length 
35 mm) were printed with an conical nozzle of diameter 840 \1m in the absence 
of external magnetic fields and under applied magnetic fields generated by a per- 
manent magnet around the nozzle, respectively. After curing, the sheets were cut 


into dog-bone-shaped specimens with known dimensions (width 4 mm, gauge 
length 17 mm) for tensile testing. The cross-sectional area of each specimen was 
calculated by dividing the sample's original volume by its length, where the volume 
was calculated based on the sample’s mass measured before the cut and the density 
(2.434 g cm~*) of the composite ink containing 20 vol% of NdFeB. The specimens 
were tested on a mechanical testing machine (Z2.5, Zwick/Roell) with a 20 N load 
cell at a strain rate of 0.01 s~!. Nominal stress-stretch curves were plotted for 
both materials, and shear moduli ju were obtained by fitting the experimental 
curves using a neo-Hookean model (Extended Data Fig. 2c). The shear modulus 
of the magnetized-ink-based material we obtained was ju = 330 kPa. The specimen 
printed in the presence of external fields showed higher shear modulus compared 
with the specimen printed without external fields (44 = 245 kPa). This higher shear 
modulus may be attributed to the field-induced alignment of ferromagnetic par- 
ticles along the filaments when magnetic fields were applied during the printing. 
Imaging and videography. Images and videos of the programmed shape changes 
of the printed samples were taken under a blue LED light source (460-nm wave- 
length), while applying external magnetic fields in ambient conditions. The 
external fields were generated by either permanent magnets (K&J Magnetics) or 
electromagnets (APW Company) depending on the shapes and sizes of the printed 
samples (Extended Data Fig. 10). Images and videos were taken with a DSLR cam- 
era (D7000, Nikon) with a red-coloured filter (HMC R25A, Hoya) that transmits 
wavelengths above about 600 nm (Supplementary Video 3). To reduce the effects 
of friction between the printed samples and the substrate, fine glass powders with 
an average size of 9-13 jm (glass spheres, Sigma Aldrich) were applied to the 
substrate as a dry lubricant. 

Finite-element analysis. For all designs presented in Figs. 1-3, the shapes 
deformed in the presence of external magnetic fields were simulated using a 
user-defined element subroutine implemented in the commercial finite-element 
analysis software ABAQUS. For all simulations, the following input parameters 
were used: the shear modulus pz = 330 kPa, the bulk modulus K = 1,000y (the 
large bulk modulus was chosen to approximate incompressibility), and the uni- 
form external magnetic field B = 200 mT. For the magnetization parameter in the 
simulation, the experimentally measured value (M = 81 kA m7}; see Fig. 1b) was 
used for samples printed with inks containing 20 vol% of NdFeB particles through 
nozzles with 410 jum diameter under the applied field of 50 mT at the nozzle tip. 
Validation of printing-induced magnetization. The magnetization of a sample 
printed with a nozzle of diameter 410 1m in the presence of magnetic fields (50 mT) 
was measured while varying the angular position of the printed fibres with respect 
to the horizontal direction (Extended Data Fig. 4a). The maximum magnetiza- 
tion value was measured when the printed fibres are aligned with the positive x 
direction, in which an external magnetic field is applied by the vibrating sample 
magnetometer. The measured magnetization value decreased as the angle increased 
and reached almost zero when the printed fibres were vertically aligned. When 
the specimen was rotated by 180°, the sign of the measured magnetization was 
changed, indicating that the specimen’s magnetic polarity was reversed, while the 
magnitude remained almost unchanged (Extended Data Fig. 4b). It demonstrates 
that the printed fibre direction can represent the overall magnetization direction 
(that is, magnetic polarity) of the printed sample. 

Calculation of energy and power density. As evaluation criteria for actuation 
performance, we used three quantities that can either be found or calculated from 
the literature. First, we used the actuation rate f=1/t, where f is the time taken to 
generate one complete actuation cycle. For materials with irreversible actuation 
such as shape memory polymers””””, the actuation rate was considered to be the 
time taken for completing the first cycle of actuation. Second, we used the energy 
density pp, which can be determined from force-stroke curves for actuation. Third, 
we used the power density pw = fpr to evaluate the overall actuation performance 
of various 3D-printed shape-programmable soft materials. For papers that report 
force-stroke curves for actuation or the corresponding values of energy density, 
we used the reported values. When the data were not available, we approximated 
the values for different classes of materials. 

For elastically actuated materials such as shape memory polymers, liquid crystal 
elastomers and other composite materials, the energy density was approximated 
as*? pg = E(€,)"/2 where E is Young’s modulus in the rubbery state and ¢, is the 
actuation strain. For osmotically actuated materials such as hydrogels and other 
swelling-based actuators, the energy density was approximated as** pp = AJTe,/2, 
where AJJ is the change in osmotic pressure upon swelling and ¢, is the swell- 
ing strain. For magnetically actuated materials with programmed ferromagnetic 
domains, the energy density was approximated as pg = 3j1(€,)°/2, where ju is the 
shear modulus and ¢, is the strain that developed in a deformed state due to the 
applied magnetic field. The quantities to evaluate the actuation performance were 
summarized in Extended Data Fig. 8. 

Data availability. All data generated or analysed during this study are included in 
the published article and its Supplementary Information, and are available from 
the corresponding author on reasonable request. 
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Code availability. The codes for 3D printing and the user element subroutine for 
numerical simulation are available upon request. 
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Extended Data Fig. 2 | Mechanical characterizations of the ink and the 
printed materials. a, b, Apparent viscosity as a function of applied shear 
rate (a) and storage modulus as a function of applied shear stress (b) for 
20 vol% magnetized ink (red), 20 vol% nonmagnetized ink (black) and 
support ink (grey). c, Nominal tensile stress—stretch curves (solid lines) 
for specimens printed with the magnetic ink in the absence of external 
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1.15 1.20 


Stretch 


fields (black), with applied magnetic fields of 50 mT (red) at the nozzle 
tip generated by a permanent magnet, and the elastomer matrix with 
no magnetic particles (blue). The shear modulus j of each material was 
obtained by fitting the experimental curves to a neo-Hookean model 
(dashed lines). 
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Extended Data Fig. 3 | Micro-computed tomography images of printed 410 jum in the presence of an applied magnetic field of 50 mT at the nozzle 
fibres. a, A fibre printed with a nozzle of diameter 410 jum in the absence tip. No obvious aggregation of ferromagnetic particles in the printed fibres 
of applied magnetic field. b, A fibre printed with a nozzle of diameter can be observed. 
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induced during printing under the applied magnetic field. b, Magnetization values measured at various angular positions of the 
a, Experimental setup with a vibrating sample magnetometer for printed fibres with respect to the external magnetic field applied by the 


measuring the magnetization of a sample printed with a nozzle of diameter _ vibrating sample magnetometer. 
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at each volume fraction, a permanent magnet is used to generate external under the applied field of 50 mT yields a magnetic moment density that 


fields (50 mT at the nozzle tip). The magnetization values of printed 


corresponds to 63%-64% of the maximum achievable magnetization at 


samples (red) are compared with the maximum achievable magnetization each volume fraction of NdFeB particles. 


values (black) measured from uniformly magnetized samples. The 
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Extended Data Fig. 6 | Schematic designs and dimensions of the two- 
dimensional and 3D structures in Figs. 2 and 3. a, An annulus encoded 
with alternating domains that are equidistant. b, An annulus encoded with 
alternating domains that vary in size. c, A Miura-ori fold encoded with 
alternating oblique patterns of ferromagnetic domains. d, A hollow cross 
encoded with alternating ferromagnetic domains along the perimeter. 

e, f, Quadrupedal (e) and hexapedal (f) structures enabled by folding of 
the magnetically active segments surrounding the magnetically inactive 


ml 


height: 6 


32.5 


segments. g, Two adjoining hexagonal tubes programmed to form 
undulating surfaces under applied magnetic fields. h, A pyramid-shaped 
thin-walled structure programmed to elongate in its diagonal direction 
along applied magnetic fields. i-k, A set of auxetic structures (with 
negative Poisson's ratios) programmed to shrink in both length and width 
under applied magnetic fields. The dimensions in this figure are given in 
millimetres. 
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Extended Data Fig. 7 | Overall fabrication process of printing support ink prevents diffusion of catalyst molecules through the interface 
multilayered structures assisted by the use of support inks. a, Printing and thus prevents imperfect curing of the adjacent magnetic inks. Ac 
multilayered hexagonal arrays using magnetic and support inks. The use of __ denotes the difference in catalyst concentration between the support and 
support inks as fugitive buttresses enables stacking the deposited magnetic magnetic inks. c, The printed magnetic inks are cured by heating at 120°C 
inks stably up to tens or even hundreds of layers. b, Chemical composition _ for 1 h. The support ink is then removed by solvent rinses. 

of magnetic and support inks. The higher concentration of catalyst in the 
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Extended Data Fig. 8 | Actuation performance of 3D-printed shape- 
programmable soft materials. a, Energy density and actuation rate 

of our magnetically responsive structures presented in Figs. 1-3 

are plotted and compared with those of existing 3D-printed shape- 
programmable soft materials in the literature?” *°*?>". b, Power density 
is calculated as energy density multiplied by the actuation rate of each 
material and plotted for comparison; the materials are listed in order of 


increasing power densities. PNIPAAm = poly(N-isopropylacrylamide); 
PMBA = poly(benzyl methacrylate); PHEMA = poly(hydroxyl 

ethyl methacrylate); PLA = poly(lactic acid); RM82 = 1,4-bis-[4-(6- 
acryloyloxhexyloxy)benzoyloxy]-2-methylbenzene. TangoBlack and 
VeroClear are commercially available acrylic photocurable polymers from 
Stratasys Ltd. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a LEDs 


Conductive silver ink 


Magnetic re 


PDMS (insulator) 


Cathode Electrode substrate 


(gold-sputtered) 


Bottom view 


At rest (0 mT) Mode 1 (30 mT) Mode 2 (30 mT) 


= + 
+ 
1.9~2.2V 
At rest Mode 1 Mode 2 
Extended Data Fig. 9 | Schematic designs and working principles which yield different electronic functions (red micro-LEDs lit up in Mode 1 
of the reconfigurable soft electronic device demonstrated in Fig. 4a. and green micro-LEDs lit up in Mode 2). c, Schematic diagram of the 
a, Exploded and bottom views of the printed device, in which soft embedded soft electronic circuits, which are designed to turn active only 
electronic circuitry and components are embedded by means of a hybrid in the designated mode of transformation owing to the selective contact 
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shapes depending on the direction of applied magnetic fields of 30 mT, 
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Extended Data Fig. 10 | Schematic illustrations of the methods for 
applying magnetic fields to actuate the printed structures. a, b, The 
magnetic fields for actuating the printed structures can be applied in 
two ways. A pair of electromagnetic coils are used to generate a uniform 
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magnetic field (a). A NdFeB magnet (width 2 in, length 3 in, thickness 
0.5 in, surface flux density 300 mT) is used to create spatially varying 
magnetic fields for dynamic actuation by combining vertical, horizontal 
and rotational movements of the magnet (b). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


1 oa Ie en 


https://doi.org/10.1038/s41586-018-0188-x 


Velocity-resolved kinetics of site-specific carbon 
monoxide oxidation on platinum surfaces 
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Charles T. Campbell’, Dirk Schwarzer, Dan J. Harding!*’, Alec M. Wodtke!4 & Theofanis N. Kitsopoulos?*°* 


Catalysts are widely used to increase reaction rates. They function 
by stabilizing the transition state of the reaction at their active site, 
where the atomic arrangement ensures favourable interactions!. 
However, mechanistic understanding is often limited when 
catalysts possess multiple active sites—such as sites associated 
with either the step edges or the close-packed terraces of inorganic 
nanoparticles*-4— with distinct activities that cannot be measured 
simultaneously. An example is the oxidation of carbon monoxide 
over platinum surfaces, one of the oldest and best studied 
heterogeneous reactions. In 1824, this reaction was recognized to 
be crucial for the function of the Davy safety lamp, and today it is 
used to optimize combustion, hydrogen production and fuel-cell 
operation®*. The carbon dioxide products are formed in a bimodal 
kinetic energy distribution’ 1%; however, despite extensive study’, 
it remains unclear whether this reflects the involvement of more 
than one reaction mechanism occurring at multiple active sites”, 
Here we show that the reaction rates at different active sites can be 
measured simultaneously, using molecular beams to controllably 
introduce reactants and slice ion imaging!“ to map the velocity 
vectors of the product molecules, which reflect the symmetry and 
the orientation of the active site!®. We use this velocity-resolved 
kinetics approach to map the oxidation rates of carbon monoxide 
at step edges and terrace sites on platinum surfaces, and find that 
the reaction proceeds through two distinct channels!!"™: it is 
dominated at low temperatures by the more active step sites, and 
at high temperatures by the more abundant terrace sites. We expect 
our approach to be applicable to a wide range of heterogeneous 
reactions and to provide improved mechanistic understanding of 
the contribution of different active sites, which should be useful in 
the design of improved catalysts. 

The oxidation of CO on the (111) surface of platinum—the simplest 
close-packed single-crystal surface—proceeds through the Langmuir- 
Hinshelwood mechanism”””. O, is activated by dissociating into 
adsorbed oxygen atoms, which combine with adsorbed CO to form 
CO, with substantial excess energy. This can lead to hyperthermal 
CO, which has high kinetic and internal energy*” and exhibits angu- 
lar distributions that are narrower than those expected for a molecule 
at thermal equilibrium with the surface’’”!°; however, some reaction 
conditions also result in a thermal channel!!“'3, Although this bimodal 
generation of products suggests that CO oxidation occurs through 
more than one reaction!”!3, the dominant view at present is that all 
products form through the same transition state, and a fraction des- 
orbs promptly with hyperthermal velocities whereas the rest become 
trapped and thermalized at the surface before desorbing". 

Our study of this reaction exploits the correlation between the angle 
and velocity of reaction products and the symmetry and orientation of 
surface reaction sites!®. This enabled us to extract site-specific kinetic 
data by data-mapping the temporal evolution of velocity-resolved 
products. We used pulsed-molecular-beam surface dosing of both CO 


and O; in combination with slice ion imaging'*!°. The O, beam was 
used to control oxygen coverage and the pulsed CO beam was used to 
initiate the reaction; together with synchronized pulsed-laser ionization 
of CO, this enabled us to obtain the CO, flux as a function of reaction 
time. At each time point in the reaction, ion imaging provided distri- 
butions of the velocity vectors of the products. 

For CO oxidation on Pt(111) with a step density of 0.25%, ion images 
of the CO) produced (Fig. 1a) show a bimodal speed distribution!» 
(Fig. 1b). The low-velocity component was fit to a Maxwell-Boltzmann 
distribution, whereas the high-velocity component is hyperthermal. 
The angular distributions of the two channels (Fig. 1c) show that the 
hyperthermal component has a sharp peak along the surface normal 
(approximately cos’@), whereas the thermal channel follows a cos0 
distribution. 

To obtain site-specific kinetics, we recorded the integral of the ion 
image intensity over selected velocity ranges (red and blue rectangles in 
the inset of Fig. 2) as a function of the delay between the CO and laser 
pulses. The raw time-dependent signals are proportional to product 
density and depend not only on the reaction time at the surface, but also 
on the flight time of the CO, from the surface to the ionizing laser spot. 
By recording the velocity of the products (see Methods), we corrected 
for the CO; flight-time and obtained the relative flux in both chan- 
nels. From the measured angular distributions, we corrected the signal 
strength to reflect the true branching ratio between the two channels. 
Thus, we rigorously obtained the relative product flux as a function of 
reaction time, hereafter referred to as the kinetic trace. 

Kinetic traces are shown in Fig. 2 for both the hyperthermal (red) 
and thermal (blue) channels. In both, the CO, formation rate first 
increases as CO adsorbs onto the surface (black dashed line), and then 
decays with an effective lifetime, Teg (see Methods), as the adsorbed 
CO is removed. Teg for the thermal and hyperthermal channels— 
hereafter referred to as a and tae ™, respectively—are quite different; 
Tat" is shorter than 7"? under all conditions in this study. This is 
inconsistent with a bimodal velocity distribution resulting from partial 
thermalization of products from a single reaction”!!, as that would 
require the two kinetic traces to be identical. It should be noted that the 
desorption of physisorbed CO, is fast compared to the reaction time. 

We assign the hyperthermal and thermal channels to the reaction of 
CO with oxygen adatoms at terraces and steps, respectively, for three 
reasons. First, when we increased the step density 66-fold using a 
Pt(332) crystal, the hyperthermal channel disappeared completely at 
low oxygen coverage, at which nearly all oxygen atoms are bound at 
steps (lower left inset of Fig. 2). This confirms that the hyperthermal 
channel does not occur at steps. Second, in the limit of zero oxygen 
coverage, Tef is determined by any non-reactive channels that consume 
adsorbed CO (see Methods). For the thermal channel, the zero- 
coverage rate constants (filled blue circles in Extended Data Fig. 4) are 
in excellent agreement with known values of CO desorption from 
steps!8, confirming that CO molecules adsorbed at steps (hereafter 
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Fig. 1 | Slice ion imaging of CO2 produced by CO oxidation at Pt(111). 
a, Ion image of CO produced by the catalytic oxidation of CO ona 
Pt(111) surface. The velocity vector of the incident CO beam is shown by 
the light green arrow, as are a few of the CO2 product vectors appearing 
in the thermal and hyperthermal channels. CO; speed increases to the 
right. The position of the ionizing laser beam is shown by the dashed blue 
line. b, Velocity probability distribution of COz molecules. The thermal 


denoted cos?) are important to the thermal reaction. Third, we 
calculated the minimum energy path of the terrace reaction 
CO;*" + OF" using density functional theory (DFT; see Methods). 
This shows an energy release of 1.9 eV and an early transition-state 
structure (that is, it resembles the reactants), which is consistent with 
the average translational energy release of 0.38 eV that is seen in the 
hyperthermal channel. This assignment is also consistent with the pre- 
viously observed dominance of the hyperthermal channel at 880 K® 
(see Methods) and the prevalence of the thermal channel that is seen 
with increased step density'!. 

Considering the above, we interpret the kinetic traces of Fig. 2 in 
terms of competing reactions that occur with oxygen bound at terrace 
and step sites. We consider the terrace reaction competing with a process 
involving diffusion that converts CO!*" to COS”. This explains how 
adsorption at terraces (more than 99% of the CO adsorbs initially at 
terraces, as the Pt(111) step density is less than 1%) can lead to more 
CO, production at steps. The kinetic trace of the thermal channel shown 
in Fig. 2 cannot be fit with a single exponential function (see Extended 
Data Figs. 6, 8), which suggests the presence of two step reactions. 

In Methods, we present a simple kinetic model that describes our 
data over the full range of oxygen coverages and surface temperatures 
using three reactions: 


KIT 
Terr Terr 2 hyperthermal 
colo" +co8 


KIS 
Terr Step “2 thermal 
Co! +08? = +60! 


SS 
Step Step “2 thermal 
cos + 08” ,co! 


We also include COS? desorption, as well as CO," desorption and 
conversion of CO;*" to COS". The determination of oxygen coverage 
and the Os?:O/*" ratio is described in Methods. We note that 
velocity-resolved kinetics of CO oxidation on Pt(332), where the step 
density is 66-fold higher, supports the three-reaction mechanism. 

The derived pre-exponential factors and activation energies are 
shown in Table 1 and the temperature dependent reaction rate con- 
stants are shown in Fig. 3. The results obtained on the Pt(111) and the 
Pt(332) crystals agree well. The rate constants for CO oxidation at steps 
are similar to those for reaction at terraces. Despite this, the product 
yield through step reactions dominates at all temperatures used in this 
work and in previous studies (area to the left of the vertical line in 
Fig. 3). Even for a Pt(111) crystal with a step density of only 0.25%, 
reactions at steps dominate. Analysis with our kinetic model shows that 
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product channel is fitted to a Maxwell-Boltzmann function (translational 
temperature, Ttrans = 483 K; dashed blue line) and the hyperthermal 
channel is fit to a flowing Maxwell-Boltzmann function (Ttrans = 894 K, 

a = 190 meV; dashed red line). c, Velocity-resolved angular distributions: 
hyperthermal (red) and thermal (blue) channels. The hyperthermal 
channel is compared to a cos*@ function (red line) and the thermal channel 
is compared to a cosé function (blue line). a.u., arbitrary units. 


this reflects the fact that conversion of CO;*" to COS"? is much faster 
than the TT reaction in the temperature range of this and previous 
work. We therefore conclude that all previous kinetics studies were 
strongly influenced by step reactions (see Methods). 

For this reason, this is the first report to our knowledge of the rate 
parameters for the Pt(111) terrace reaction, ke o Only the rate of the 
TT reaction has been calculated from first principles!””°, as such, the 
new rate parameters provide the first opportunity to benchmark the ab 
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Fig. 2 | Kinetic traces of CO2 obtained from velocity-resolved kinetics 
experiments. The inset labelled Pt(111) shows an ion image with velocity- 
space integration windows for the hyperthermal (red; 1,280-1,610 m s-}) 
and the thermal (blue; 420-590 m s~') channels. The dashed black line 
represents the measured CO dosing function; the onset of the incident 
CO beam is taken as the zero of time. The solid red and blue lines are 

the fits resulting from the kinetic model (see Methods). The two traces 
are normalized to reflect the experimental time-integrated branching 
ratio between the two channels (8.6:1 in favour of the thermal channel) 

at this temperature and oxygen coverage. The step density was 0.0025 
monolayers, the oxygen coverage was 0.04 monolayers and the dose of 

a single CO pulse was 2 x 10-° monolayers. The surface temperature 

was 593 K. The experimental conditions for the (332) image are: oxygen 
coverage, 0.04 monolayers; surface temperature, 593 K; step density, 0.167 
monolayers. Here, oxygen atoms are expected to be entirely at step sites, 
because O, dissociates more easily at steps than at terraces”? and oxygen 
atoms are bound more strongly at steps than at terraces”. For the reaction 
on the (332) sample, the hyperthermal channel is absent. 
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Table 1 | Rate parameters for the elementary processes involved in the site-specific oxidation of carbon monoxide ona platinum catalyst 


Elementary process A(s-}) E, (eV) 
ko Desorption from terraces* 5.9°33 x 1078 1.28 + 0.02 
kg Desorption from steps? 1.5733 x 10! 1.18 + 0.056 
kJS 111 (332) Diffusion from terrace to step® 2.1713 x 10° (1.5 x 10°) 0.30 + 0.1 (0.32 + 0.1) 
kj! 111 (332) COterr + Oterr reaction 3.5735 x 10° (3.8°33 x 10%) 0.6+0.1 (0.6+0.1) 
Theory!? 5.1 x 1012 0.74 
k3$ 111 (332) COterr + Ostep reaction 5.9*38 x 107 (4.1785 x 10’) 0.40 + 0.1 (0.40 + 0.1) 
k3$ 111 (332) COstep + Ostep reaction 2.973% x 10° (2.673 x 10°) 0.65 + 0.1 (0.65 + 0.1) 
The rate parameters reported here should be used with caution outside the temperature range used in this work. 
@Measured independently, as published in ref. ©. This value also compares well with previous reports. See ref. 27: A=3.5132 x 10!35-1 E, = 1.27 + 0.07 eV. 


Obtained from measurements of desorption of CO from Pt(332). Compares well with rate constants reported in ref. 2’. 
°An activation energy of 0.3 eV was chosen following ref. 28. This should be considered an effective conversion rate. 


initio rates of this important catalytic reaction. Figure 3 shows one such 
calculation of the rate constants of the TT reaction, for which DFT is 
used to generate input for a transition-state-theory calculation of the 
rate constant!*. The theoretical rate constant is 20-100 times larger than 
that seen experimentally. The theoretically calculated activation energy 
is higher than that derived from experiment (Table 1), as is expected 
when using a revised Perdew-Burke-Ernzerhof (RPBE) functional?!. 
Improving the calculation would lower the barrier; however, this would 
only amplify the disagreement with experiment. Thus, it is clear that 
the theory fails to reproduce the prefactor of the rate constant. This can 
be the result of an incorrect treatment of reactant entropy” and/or 
dynamical recrossing at the transition state. 

Velocity-resolved kinetics provides a powerful tool for catalytic 
kinetic studies. Previous studies have attempted to obtain CO oxida- 
tion kinetics without velocity resolution by measuring phase shifts with 
modulated reactant molecular beams””’. To analyse these phase-lag 
data, the kinetic trace was assumed to follow a single exponential 


decay’”?3 ; this was effectively a single-reaction assumption that resulted 
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Fig. 3 | Rate constants for CO oxidation on platinum steps and terraces. 
Three reactions are identified in this work: the reaction of CO,°" with 
terrace-bound oxygen atoms (TT) and two reactions of step-bound oxygen 
atoms with COS“? (SS) and CO!*" (TS). Results obtained from a Pt(111) 
surface with a step density of 0.25% and those obtained from a Pt(332) 
crystal with a step density of 16.7% are shown as black and red lines, 
respectively. The units of the rate constant are those used when oxygen 
coverage is given as a (unitless) fraction of a monolayer. The grey line 
shows the results of transition-state theory (TST) calculations of the TT 
reaction rate based on a DFT calculation of the transition state’. It 
considerably overestimates reactivity. The region to the left of the vertical 
line shows the range of temperatures studied here and in previous studies. 
The region to the right of this line shows the range of temperatures typical 
for industrial catalysis. 
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in an activation energy that was dependent on oxygen coverage’. 
However, we show in Methods that these were only apparent activa- 
tion energies that result from the temperature and oxygen-coverage- 
dependent influence of the step and terrace reactions. 

The phase-lag method is analogous to frequency domain kinetic 
methods that are used in biophysics for obtaining (exponential) fluo- 
rescence lifetimes”, and was the main method used before the advent 
of pulsed-laser-based pump-probe methods. In providing the kinetic 
trace directly, velocity-resolved kinetics serves the function of a pump- 
probe experiment for neutral matter. This enabled us to easily identify 
three elementary reactions that comprise a site-specific mechanism 
of CO oxidation on platinum. The activation energies for these three 
reactions are not dependent on oxygen coverage. 

The relative importance of different reacting geometries occurring 
at different active sites has been described previously in terms of a 
geometry probability-weighted activation energy”: E,” = E,, — kTInA,. 
Here A; is the abundance of the ith reacting geometry and E,; is its 
activation energy. The results of this work reveal that the values of A; 
depend on diffusion and desorption, an outcome that is also relevant 
to real-world catalysts that are often operated at high temperatures. In 
Extended Data Fig. 10 we show that the terrace reaction dominates on 
Pt(111) and Pt(332) at high temperatures, whereas at low temperatures 
the step reactions dominate. At high temperature, CO desorption 
becomes much faster than CO diffusion; as such, the site of CO adsorp- 
tion determines which reaction occurs. Because step edges are minority 
sites on both samples, adsorption (and therefore reaction) occurs 
primarily at terraces. Extended Data Fig. 10 also shows that, under 
high-temperature conditions in which CO desorption suppresses the 
importance of diffusion, the efficiency of the conversion from CO to 
CO) is also reduced. We suggest that this may be a general phenomenon: 
low-concentration, high-reactivity defects on surfaces will be important 
in low-temperature catalysis in which reactant diffusion is faster than 
desorption, whereas at high temperatures, the majority facet dominates 
the chemistry and the efficiency of the reaction decreases. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
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METHODS 


Experimental methods. The instrument used in this work has been described 
recently for the measurements of desorption rates of CO from Pt(111) and 
Pd(111)?°. A platinum crystal is first prepared by argon ion sputtering and anneal- 
ing. The O2 (normal incidence) and CO (20° from normal) pulsed molecular 
beams cross at the platinum surface at a selected repetition rate ratio (RRR).We 
used 7-bar backing pressure for the O2 beam with a nozzle opening time of 70 1s. 
This gave an O, flux of 1.1 x 10’! molecules per pulse. A 5% mixture of CO in 
He at a total backing pressure of 7 bar and a nozzle opening time of 25 1s gave a 
CO flux of 2.2 x 101° molecules per pulse. The diameter of both molecular beams 
at the surface was 3 mm. The O; beam was always fired at least 500 1s before the 
CO beam. The CO pulse initiates the reaction and is consumed before the next 
CO pulse arrives. A single CO pulse alters the oxygen coverage negligibly. CO2 
products were photoionized by a nine-photon non-resonant absorption process 
induced by the focused output (20 cm focal length lens) of a Coherent Astrella 
Ti:sapphire laser producing 1-5 mJ per pulse with a pulse duration of 100 fs at a 
wavelength of 800 nm. The ion camera system was time-gated at a controlled delay 
with respect to the ionizing laser pulse to restrict detection to m/z = 44 (CO"). 
The laser ionization pulse was synchronized to the CO dosing pulse; the delay 
between these two pulses is used to analyse the kinetics of the reaction and at each 
delay, ion images are recorded using modified DaVis software (LaVision). Angular 
distributions are obtained by physically scanning the focal point of the ionizing 
laser beam back and forth in front of the reacting surface while accumulating the 
ion image. For kinetics measurements, the laser position was fixed and reaction 
products were detected close to the surface normal. The RRR value corresponds 
to a specific oxygen coverage, which is measured in a separate titration experiment 
in which the O2 beam is suddenly turned off and the CO; signal is recorded until 
all oxygen is removed from the surface. This is calibrated by titration against an 
oxygen-saturated platinum surface. See Methods section “Total oxygen coverage, 
[O,], from a titration experiment. 

Conversion of the time-dependent raw data to the kinetic trace. In measuring 
kinetics, the fundamental quantity of interest is the kinetic trace, defined as the 
product flux as a function of the reaction time. Of course, kinetic traces for reac- 
tant loss and transient populations of intermediates can also be of interest, but for 
simplicity we consider here only the appearance of the product. 

The oxidation of CO at platinum exemplifies the challenges in obtaining this 
quantity experimentally. As the reaction is initiated by CO adsorption, it is usual 
to use pulsed molecular beam methods. Time is required for CO oxidation to take 
place and for the product, CO:, to desorb from the surface. The time at which 
the product appears at a detector is then recorded with respect to the reaction- 
initiating CO pulse. Hence, the time dependence of the raw signal obtained in this 
experiment is composed of three contributions: the time required for the CO pulse 
to travel to the surface, the reaction time at the surface, and the time required for 
the products to travel to the ionizing spot created by the laser. Only the second 
time period is of interest in a kinetics experiment. The first is identical for the two 
reaction channels and therefore only relevant to the determination of the time at 
which the reaction is initiated. Finally, the third time period is different for each 
of the two reaction channels and must be measured. 

In velocity-resolved kinetics we measure ion images such as those shown in 
Extended Data Fig. 1a for each time delay between the CO and ionizing laser 
pulses. Here, the signal intensity is proportional to the CO) number density; 
as such, the hyperthermal channel (red box) is only weakly seen. 

Multiplying the intensity by the velocity converts the density image to a CO2 
product flux image (Extended Data Fig. 1b). We then select velocity integration 
windows (green/blue and red rectangles in Extended Data Fig. 1) and record the 
average intensity versus the delay between the CO pulse and the ionizing laser 
pulse, producing raw kinetic traces. Extended Data Fig. 1c, d shows how this 
appears as a density signal and as flux, respectively. 

Extended Data Fig. 2a shows the velocity distribution of the product CO:, and 
can be decomposed into two contributions: thermal (blue) and hyperthermal (red). 
The velocity windows used for integration in Extended Data Fig. 1 are also shown 
(blue and red hatched areas). The kinetic traces are then scaled to represent the 
actual area under the fit to the velocity distribution (blue and red shaded areas). 

A measurement of the angular distribution (Extended Data Fig. 2b, c) shows 
that the thermal CO, follows a cosine distribution, whereas the hyperthermal CO 
follows a cos* distribution, in agreement with previous reports!! 1371, 

The velocity distribution in Extended Data Fig. 2a includes only ions observed 
within 3° of the surface normal. In order for the kinetic traces to represent the 
total flux over all angles, we must scale up the kinetic trace of the thermal channel 
by an angular factor: 


f - cos*@sinddé f, a cosOsinOdé 
0 . Jo = 


3 ; o 4.7 
f ° cos*sinddd ia *° cosbsinodd 
0 0 


Note that in Extended Data Fig. 1d, the signal of the hyperthermal channel appears 
earlier than that of the thermal channel. This is a consequence of the reduced flight 
time of hyperthermal CO) to the ionizing laser spot, relative to that of the thermal 
CO . By subtracting the flight time using the average velocity associated with the 
integration window of each channel (12 and 35 1s, respectively), we can correct 
this. See Extended Data Fig. 2d, in which we have also shifted the zero of time to 
the onset of the CO adsorption. The temporal shape of the CO dosing pulse is 
measured as described in ref. 7”. 

Effective lifetimes as a function of oxygen coverage and step-dependent 
desorption. We perform a preliminary and approximate data analysis at low 
oxygen coverages, deriving effective lifetimes for each reaction channel, which 
leads to insights into the reaction mechanism. Extended Data Fig. 3a shows an 
example of the analysis. Here, the effective lifetime of the two reaction channels is 
derived through a simple exponential fit to the kinetic traces, convoluted over the 
time profile of the pulsed molecular beam. 

The lifetimes obtained from the fits are effective values, Ter = 1/Keg, which 
depend on the total oxygen coverage [O,]. The CO doses here are so small (less 
than 0.1% of a monolayer) that [O,] is independent of time during all transients. 
Referring to Extended Data Fig. 3b, it can be shown that keg = ko + k)[O,], where 
ky is the rate constant of the oxidation reaction and ky is a non-reactive competitive 
process removing the reactant CO, for example CO desorption. 

With this mechanism, a simple set of differential equations can be solved ana- 
lytically. This enables us to define the effective lifetime: 


d[CO,] _ 


a ko[CO,] + k,[CO,][O,] = [CO,] (ko + k2[0,]) = [CO,] kote 


[CO,] ,= [CO] ,e 


Flux(CO,) = k,[0,] [CO,] , = ky [0,] [CO,] ,e*«" 


By varying the repetition rates of the two pulsed beams, we vary the fluxes of CO 
and QO, and thereby the coverage of oxygen. Note that the use of repetition rate 
gives us very precise control of the fluxes. Methods section “Total oxygen coverage, 
[O,], from a titration experiment’ shows how the oxygen coverage is derived. In 
this way, we can observe a linear dependence between ke and the O, flux at low 
O> fluxes. Extended Data Fig. 3c shows an example of how k{f°"™" depends on the 
oxygen flux at several surface temperatures. 

For each temperature, we extract ki?" from the zero-coverage intercept of 
the linear fit. These are shown as filled circles in Extended Data Fig. 4, where they 
are compared to literature rate constants for CO desorption from Pt(111) steps 
(plus signs and filled circles) and terraces (triangles). The values of fehermal derived 
here are in excellent agreement with desorption rate constants for CO adsorbed at 
steps. 

Similar analysis for the hyperthermal channel on Pt(332) also reveals a non- 

reactive competitive channel that removes CO, kh yperthermal | Which is consistent 
with CO diffusion across terraces together with desorption. 
DFT calculation of minimum energy path for the terrace reaction. We per- 
formed DFT calculations within the generalized gradient approximation using the 
aimsChain tool within the FHI-aims package* to map out the minimum energy 
path of the CO oxidation reaction on Pt(111) terraces. The calculation used the 
RPBE functional in the spin-unpolarized (restricted) formalism with van der Waals 
corrections**. Other details of the calculations are as follows: A p(3 x 3) cell with 
4 layers with an optimized lattice constant of 4.00 A was used as a model for the 
Pt(111) surface. To find the initial and final state for the minimum energy path 
(MEP) calculation, we carried out geometry optimizations in which just CO and 
oxygen atoms were allowed to move and all surface atoms were restricted to their 
equilibrium positions. The optimizations were stopped when the force on every 
atom was <0.01 eV A~’. The minimum energy configurations found in this way 
were also checked against tight basis sets. For both MEP and geometry optimi- 
zations, the reciprocal space was sampled with a4 x 4 x 1 k-point grid, and to 
account for the electron occupation a Gaussian function with a width of 0.2 eV 
was used. The MEP calculations were performed using the string method* with 
light basis sets. To sample the MEP we have taken 15 images. The convergence of 
the pathway was reached when the forces between every image were smaller than 
0.15 eV A-!. To determine the transition state more accurately, we used a force 
criterion of 0.05 eV A“! for the climbing image. 

Extended Data Fig. 5 shows that the reaction is highly exoergic and passes over 
a substantial reaction barrier. The structure of the transition state is shown in 
the inset and is still far from the structure of the product, similar to that found in 
previous calculations!*””, This so-called ‘early barrier’ indicates that a substantial 
fraction of the reaction energy will be channelled into vibrational excitation of 
the product, CO. 
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We performed ab initio molecular dynamics calculations using the system and 
functional defined above, starting at the transition state determined from the MEP 
calculations to determine the translational energy of the product CO . The results 
from about 10 trajectories show that the values of the final translational energy 
are between 660 meV and 715 meV for the flexible slab at T = 0 K, and between 
720 meV and 920 meV for the rigid slab. The results of this electronically adiabatic 
calculation are consistent with those of experiments, in which molecular energy 
may also be shared with the electron-hole pairs of the metal. 

Kinetic modelling of the velocity-resolved data for CO oxidation on Pt(111). 
Reaction scheme. The kinetic model is based on the following reaction scheme for 
CO oxidation: 


COgeam ae Sstep =2, COstep Step adsorption (1) 
SCO . 
COgeam + Sterr —? COrerr Terrace adsorption (2) 
k's 
COrer + Sgtep —F COgtep + Sere Diffusion (3) 
ke 
CO pep — COg + Serr Terrace desorption (4) 
ko 
COgtep —* CO, + Ssiep Step desorption (5) 
k Tr ; 
COF er + OF err 4 cop? ertncemel, 2Srerr TT reaction (6) 
kgs thermal 
CO rer: + Ogtep — co; + Ste + Sgep TS reaction (7) 
CO ge, +O gen 2 COM! +258 SS reacti (8) 
Step Step 2g Step reaction 


where Sterrstep are the free sites expressed in monolayers. Because the step density 
is less than 1% on our Pt(111) sample, we omit reaction (1) and consider only 
CO adsorption at terraces. We include reaction (1) for completeness and it will 
be important for our treatment of Pt(332). The diffusion from steps back to ter- 
races has been omitted; as we see different temporal behaviour of the two kinetic 
traces in Extended Data Fig. 1, we conclude that the two CO species are not in fast 
equilibrium with each other. The diffusion from steps back onto the terrace must 
therefore be slow compared to the other processes. 
The rate equations describing this mechanism are as follows: 


[Ostep] 


Step 


A[Oseep] 
dt 


= —(kz° [COfer] + 3° [COstep]) 


7 Fo | [COs] + dr [CO rer] 


Astep— [Octep] _ [COgtep] 
d. 


d[COgep] = ke k3s [Ogtep] 
Step 


Step 


MCOrer) _ _| pr 4 grs@step Osten] (COs) rr [Oren 
a Astep Aver 
fe) 
+a! step | [CO rer] 
Step 

[Ostep] [Ospen] 
[Flux(COng™)| = kz [CO pee] + kz! [COsep] 
Step Step 

[Elux(CO!Pe**™)| — KIT [CO pag] tel 

Terr 


where the square brackets refer to concentration in monolayers of surface-adsorbed 
reactants and products with respect to the entire surface, dstep and dye;, are the 
concentrations of the step and terrace sites expressed in monolayers with respect 
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to the entire surface that is, dstep + dterr = 1 monolayer, |Flux(CO?,.)| refers to the 
absolute value of the flux of CO2.. gas phase product molecules. All rate constants 
kare in units of s |. 

We stress here that when probing the gas-phase product from a surface reaction, 
the flux of this species must be measured and not the rate of change of the product 
density, d[CO>,.]/dt. A simple dimensional analysis of the rate law clarifies this point: 


[CO nen [Over] hopaaue monolayers ae pens 
Bre, s cm’ s 
while 
d[CO 
ACO, 5] 2 has units of PA@ric'®s — ies 
dt cm’ s 


Flux has, by definition, units of particles cm? s~. 


The physical picture that further confirms this is the fact that as the particles 
desorb from a two-dimensional surface area, after some time they occupy a certain 
volume. This volume is proportional to the velocity. Measuring simply the density 
of the particles is insufficient, as high-speed and low-speed particles will have 
‘expanded’ into different volumes. 

Ion imaging can therefore function as a flux detector and, as such, is ideally 

suited for the measurement of kinetics at surfaces that result in gas-phase products. 
Numerical solution to the kinetic model. We derive a system of ordinary differential 
equations for the reaction scheme above and solve them numerically. The initial 
values for the concentration of oxygen, that is, [Oye] and [Osiep], are provided 
as an initial guess from the titration data (see Methods section ‘Validation of [Oterr] 
and [Osiep] from the numerical solution’). The concentration of free sites, [Sterr] and 
[Sstep], are expressed in units of monolayers, that is, [Sper] = 0.9975 monolayers 
and [Sterr] = 0.0025 monolayers for an empty surface and a step density of 0.25% 
as determined from AFM measurements. The rate constants for desorption from 
steps and terraces are known from previous experiments, and the rate constant for 
diffusion is estimated from literature values. An initial guess for the rate constants 
of the reaction is provided from an analysis of ker (see Methods section ‘Effective 
lifetimes as a function of oxygen coverage and step-dependent desorptiom). The 
adsorption of CO from the incoming beam is then introduced as a perturbation 
to this system. 
Optimization process. The flux of CO} into the thermal and hyperthermal channels 
as a function of time was then obtained by solving the ordinary-differential-equation 
system using LSODA from the FORTRAN77 library ODEPACK™. This flux was 
compared to the experimental data and the residuals were minimized in a recursive 
optimization. We used four global parameters, k Ska yk, andk3%, and two local 
parameters per dataset, the concentration of [Oger] and [Ostep], to perform 
optimization with the Nelder-Mead method. Following ref. *°, [Ogtep] can occupy 
every second (fcc) platinum-step site, therefore the maximum of [Ostep] was 
restricted to <0.5ds. 

Using these four global parameters, the kinetic model was able to properly 
reproduce 18 sets of thermal and hyperthermal flux at 7 temperatures, a total of 
252 kinetic traces. Examples are shown in Extended Data Fig. 6. 

Total oxygen coverage, [O,], from a titration experiment. The total amount of oxygen 
on the surface, [O,], could be determined in a titration experiment. First, a surface 
fully covered with oxygen (0.25 monolayers) was continuously dosed with CO 
and the total CO yield was measured. Next, the CO and O2 beams were run at a 
value of RRR used when measuring kinetic traces; this established a steady-state 
oxygen coverage. After several minutes, the O2 beam was suddenly turned off and 
the total CO yield was again measured. This CO; yield was then compared to 
the yield obtained from a fully covered surface, allowing us to relate RRR to the 
resulting steady-state oxygen coverage. The result is shown as black squares in 
Extended Data Fig. 7a. 

Validation of [Otter] and [Ostep] from the numerical solution. To test whether the 
values for [O7e:r] and [Ostep] obtained from the numerical model are reasonable, we 
compare their sum to the total amount of oxygen on the surface as obtained from 
the titration experiments just described. The kinetic model result for [O,] is plotted 
as red dots in Extended Data Fig. 7a. The good agreement between the prediction 
of the numerical solution and the titration determined values of oxygen coverage 
validates the way in which the numerical model deals with the oxygen coverage. 

In addition to this, we want to confirm that the partitioning of [O,] into [Oterr] 
and [Ostep] is reasonable. The equilibrium between the two oxygen species is cal- 
culated using the canonical partition function for oxygen atoms distributed among 
step and terrace sites using a given energy difference as described in Methods 
section ‘Equilibrium calculation of oxygen-atom populations at steps and terraces. 
The step coverage is defined as 


[Ostep] 
0.5dstep 


Step — 
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with the step density dstep = 0.0025. As explained in Methods section ‘Optimization 
process, one oxygen atom occupies every second binding site, therefore the factor 
of 0.5 is used. 

A plot comparing the numerical results to the equilibrium expectation is shown 
in Extended Data Fig. 7b. Here the relative binding energy of oxygen at steps and 
terraces was varied between 0.05 and 0.16 eV. Theoretical calculations using DFT 
suggest that step oxygen is more strongly bound than terrace oxygen, by about 
0.26 eV”, 

Therefore, the data suggests that the equilibrium distribution of oxygen atoms 

between steps and terraces is not established. Given the slow diffusion speeds on 
Pt(111) at our temperatures and the large average terrace size of about 400 atoms, 
this is conceivable. The distribution on Pt(332), which has an average terrace length 
of only five atoms, appears closer to or at equilibrium (see Methods section ‘Kinetic 
studies of CO oxidation on Pt(332)’). 
Equilibrium calculation of oxygen-atom populations at steps and terraces. 
In developing the numerical model for the oxidation of CO on platinum, we 
must account for the partitioning of oxygen atoms between steps and terraces. 
Theoretical calculations using DFT suggest that step-bound oxygen is more 
strongly bound than terrace-bound oxygen (by about 0.26 eV*°7). One limiting 
approach to this problem is equilibrium partitioning. 

Whereas step-bound oxygen is favoured by energetic considerations, 
terrace-bound oxygen atoms are favoured by entropy. To describe this, we have 
used a simple expression for the equilibrium constant which can be derived from 
the canonical partition function of non-interacting oxygen atoms bound at two 
sites (steps and terraces): 


ng 
Q( E B) = ny Astep Mer —B ((ng—m) E},—E ) 
step» "Terr No» Lis —_ > a No—m e 
m=0 


Here, binomial coefficients are used to calculate the number of ways to distribute 
No oxygen atoms over n step sites (step) and n terrace sites (Merr). We take into 
account that saturated coverage of steps occurs at a 2:1 ratio of platinum to oxygen 
atoms, whereas saturation of terraces occurs at a 4:1 ratio. Ers is the energetic 
stability difference between step-bound oxygen atoms and terrace-bound oxygen 
atoms. If we use this equilibrium partitioning to characterize our experimentally 
derived results by allowing Ers to vary, we find a value that depends on step density. 
For Pt(111) we find Eys = 0.05 eV (see Methods section ‘Validation of [O7e;,] and 
[Ostep] from the numerical solution’), whereas for Pt(332) we find Ers = 0.12 eV 
(see Methods section ‘Kinetic studies of CO oxidation on Pt(332)’). We observe 
that the Ers for Pt(332) is closer to that obtained from the DFT calculations?°*’, 
which could suggest that the system on the (332) surface is closer to equilibrium. 
Kinetic studies of CO oxidation on Pt(332). The kinetic mechanism for CO 
oxidation on Pt(332) is identical to that described in Methods section ‘Reaction 
scheme’ for Pt(111); however, here the step density (0.167) is large. Therefore, 
we include both reactions (1) and (2) weighted by 0.167 and 0.833, respectively. 
Based on our success with the treatment of the Pt(111) data, we implemented the 
numerical model as follows. 

First, we used titration measurements as described in Methods section “Total 
oxygen coverage, [O,], from a titration experiment’ as a starting point for the 
determination of the total oxygen coverage as a function of the ratio of O2 to CO 
fluxes (Extended Data Fig. 7c). We then constructed limits (red shaded area in 
Extended Data Fig. 7c) within which the oxygen coverage could be varied in the 
numerical optimization to reflect the uncertainty of the titration measurements. 
For the initial conditions of the numerical optimization, we assume the partitioning 
of oxygen between steps and terraces is at equilibrium as discussed in Methods 
section ‘Validation of [Oqerr] and [Ostep] from the numerical solution. After the 
fitting of the data is complete, the partitioning of the oxygen atoms at steps and 
terraces has also been optimized and is shown as black triangles in Extended Data 
Fig. 7d. These are compared to equilibrium calculations assuming different values 
of the binding energy difference between oxygen atoms at steps and terraces. The 
results obtained from the numerical fit are similar to equilibrium values when the 
energetic preference for step-bound oxygen atoms was 0.10-0.14 eV. Compared 
to the results derived from the Pt(111) experiments, this value is much closer to 
the theoretically estimated difference in binding energy. This is probably due to 
much smaller terrace sizes leading to faster equilibration compared to Pt(111). 

Extended Data Fig. 8 shows examples of the quality of the fit obtained using the 
rate constants from Table 1. We emphasize that both reactions (7) and (8)—the 
terrace-step and step-step reactions, respectively—are necessary to account for 
the biexponential fall-off in the thermal channel. Furthermore, the magnitudes of 
the rate constants obtained from the Pt(111) and Pt(332) surfaces are in excellent 
agreement; see Fig. 3 and Table 1. 

We point out that the derived activation energies for the three reactions make 
intuitive sense. For early-barrier reactions, the relative energies of the transition 
states follow closely the relative energies of the reactants. For the TT reaction, our 


experimental activation energy is 0.6 + 0.1 eV. In considering the SS reaction, 
the step-bound oxygen atom is stabilized by about 0.15 eV (see Methods section 
‘Equilibrium calculation of oxygen-atom populations at steps and terraces’) and 
the step-bound CO is stabilized by 0.16-0.35 eV**°. Here there are a variety of 
experimental studies that give a range of results. For early-barrier reactions, the 
transition state is stabilized by a similar amount of energy as the reactants are sta- 
bilized. Therefore it is not surprising that the SS reaction has a similar activation 
energy (0.65 + 0.leV) to that of the TT reaction (0.6 + 0.1 eV). We speculate that 
the TS reaction and the SS reaction access the same transition state via two different 
reactant geometries. If true, the activation energy for the TS reaction would be 
lowered by the step-to-terrace relative binding energy of CO, which is 0.16-0.3 eV. 
This simple analysis would predict a lower activation energy for the TS reaction of 
anywhere between 0.49 and 0.25 eV, which is consistent with our experimentally 
derived value of 0.4 + 0.1 eV. 

Comparison of velocity-resolved kinetics to previous reported velocity-integrated 
kinetic results. Previous surface-kinetics experiments were not able to resolve 
product velocities. Furthermore, only a single oxidation reaction was assumed, 
enabling a pseudo-first order analysis as a function of the surface temperature 
Ts and [O,], from which activation energies were derived. Extended Data Fig. 9b 
shows reported activation energies versus [O,] for the reaction thought to result 
from CO oxidation at terraces (hollow circles and plus signs) from refs 7". 

We can use our velocity-resolved data to simulate these previous results. We 
integrate the kinetic trace over the product velocities and fit it as if pseudo-first- 
order kinetics applied (see Extended Data Fig. 9a). We do this for data at many 
temperatures and values of [O,]. The apparent activation energies obtained in 
this way are shown as filled circles in Extended Data Fig. 9b, and there is marked 
agreement with the results of previous work. 

We emphasize that the true oxidation kinetics involves three reactions, 
the activation energies of which are independent of [O,]. This exemplifies how 
velocity-resolved kinetics removes a layer of averaging that confounds accurate 
kinetic analysis. 

The interpretations of previous work suggested that the activation energy 

of a surface reaction could vary continuously with coverage, leading to a change 
of more than 0.5 eV owing to oxygen-atom occupation of neighbouring reaction 
sites at a distance of around 8 A or more. This is clearly not the case. It is notable 
that the assumption of the single-reaction mechanism leads to this erroneous 
conclusion. 
Temperature dependence of the site-specific mechanism. We used the numerical 
model with rate parameters that have been optimized to reproduce the velocity- 
resolved kinetics data to explore the behaviour of CO oxidation as a function of 
temperature and oxygen coverage. Extended Data Fig. 10a shows the CO oxida- 
tion efficiency (probability that an adsorbed CO molecule is converted to CO2). 
Extended Data Fig. 10b shows the relative fraction of the hyperthermal channel 
(TT reaction) with respect to the thermal channels (TS and SS). 

We note that the dominant role of step reactions between 500 and 700 K may 
appear inconsistent with Fig. 3, which shows ky" > k38, k3° in this temperature 
range. This points out how subtle the kinetics of CO oxidation can be. Although 
the TT reaction has a higher rate constant, it must compete with CO diffusion to 
steps, which is faster than the terrace reaction at these temperatures. Furthermore, 
CO is preferentially adsorbed at steps; as such, it does not so easily diffuse back to 
the terraces once it has arrived at a step. Although it is true that step reactions have 
lower rate constants than do terrace reactions, CO molecules that have arrived at 
steps may only either react or desorb. Because desorption is rather slow, the reac- 
tion dominates. Another factor that is especially important when the total oxygen 
coverage is low is that oxygen atoms are preferentially found at step sites, which 
obviously favours reaction at steps. 

Therefore, the relative importance of step and terrace reactions is strongly influ- 
enced by diffusion and desorption. Under conditions in which the TT reaction 
dominates, the reaction probability of conversion from CO to CO, is small. We 
explain this by observing that the TT reaction can only dominate when CO deso- 
rption is faster than CO diffusion. Under such conditions, desorption is also faster 
than the TT reaction. We note that this is consistent with previous observations, 
in which only the hyperthermal channel was seen®. We have carried out similar 
calculations for the Pt(332) catalysts with step densities of 16.7%, and the results 
are nearly identical. 

It is also interesting to note that many industrial catalysts are used in a temper- 

ature region in which the CO-CO), conversion efficiency is low. 
Sensitivity analysis. The three lower panels in Extended Data Fig. 10 show how 
the fit to the data (represented by the total fit residual ||) depends on the three 
activation energies derived from our numerical model. Each two-dimensional 
contour plot shows sensitivity and the potential correlation of fitting error between 
two fitting parameters. 

The total fit residual was obtained by summing over the array containing 
the residual at each data point for both channels under 152 different reaction 
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conditions (both temperature and oxygen coverage vary within the dataset). 
The Nelder-Mead optimization method used the same residual array as 
input. For each point within each contour plot, the activation energies were 
varied as indicated by the respective axis. The reported energy was varied 
by +0.2 eV in steps of 0.013 eV. The pre-exponential factors were adjusted to 
compensate for the effect of the change in activation energy in the middle of the 
temperature range of the experiments (320°C). In other words, when varying the 
activation energy, the rate constant at 320°C was kept constant by compensating 
the prefactor. 

We can make a number of observations from these three plots. First, the 
residual is more sensitive to changes of E,* than it is to the other two activation 
energies. This can be seen from the strong gradient along the horizontal direction 
in Extended Data Fig. 10c and the large curvature along the vertical axis in 
Extended Data Fig. 10d. Second, E.” (the vertical axis of Extended Data Fig. 10c) 
and ES (the horizontal axis of Extended Data Fig. 10d) are ineffective at compen- 
sating fit error when E, is shifted away from its optimal value. Hence both plots 
appear as long and narrow valleys aligned with one of the two axes. This reflects 
the small correlation between these fitting parameters; the slant in Extended Data 
Fig. 10d represents a weak correlation in fitting error between E,° and ES. 
Extended Data Fig. 10e shows the sensitivity of E)’ and ES and correlation error 
involved in the fitting. From this plot we can see that the reported activation 
energies indeed reflect the minimum residual and that the residuals depend 
approximately equally on E,' and E;°. The approximately circular form of this 
contour plot indicates that the two fitting degrees of freedom are approximately 
uncorrelated with one another and that an optimized fit is sensitive to both acti- 
vation energies. 

On the basis of this analysis, we suggest an uncertainty of 0.05-0.1 eV for 
E, 8 and an uncertainty of 0.1-0.2 eV for E" and ES’. 

Data availability. The data that support the findings of this study are available 
from the corresponding author upon reasonable request. 
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Extended Data Fig. 1 | Kinetics of CO, formation. a, b, Ion images of the 
product CO). a, Raw data after background subtraction, with the intensity 
of each pixel proportional to the density of CO. The pixel distance from 
the laser position (dashed red line) is proportional to the velocity, that 

is, velocity increases to the right. b, Data after multiplying each pixel by 

its corresponding velocity, with the resulting pixel intensity proportional 
to the flux and the hyperthermal channel then becoming more readily 
apparent. The red and green rectangles indicate typical velocity integration 
windows used to produce kinetic traces for the thermal and hyperthermal 
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channels. The vertical span of the boxes subtends +3° around the surface 
normal. This small angular range justifies using rectangular integration 
windows. For defining kinetic traces, we take the average intensity within 
the rectangles as the product flux normal to the surface. c, d, Raw kinetic 
trace data. c, Average density of the CO2 product integrated over the blue 
(thermal channel) and red (hyperthermal channel) rectangular areas of the 
ion image (inset). d, Same data converted to flux. This kinetic trace and 
the ion images above were measured at 350°C, a time-averaged CO flux of 


2.2 x 10 s~! cm ~* and a time-averaged O) flux of 1.1 x 10'° s~! cm. 
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Extended Data Fig. 2 | Branching between thermal and hyperthermal 
reaction channels. a, Product speed distribution. The branching ratio 
between the thermal and hyperthermal channels is obtained from this 
distribution, calculated from the ion image in Extended Data Fig. 1. 

The thermal product channel is fit to a Maxwell-Boltzmann function 
(Ttrans = 483 K; dashed blue line) and the hyperthermal channel is fit to 
a flowing Maxwell-Boltzmann function* (Trans = 894 K, a = 190 meV; 
dashed red line). The velocities used for extraction of the kinetic traces 
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are indicated by the hatched areas. b, The angular distribution for the ion 
image shown in c. c, The flux-corrected ion image that shows an angular 
distribution subtending +25° around the surface normal. The angular 
distribution was measured at a surface temperature of 400°C. d, The 
kinetic trace. We obtain the product flux as a function of reaction time for 
two channels with different speed and angular distributions (details are 
given in the text). It is now clear that the hyperthermal channel is much 
weaker than the thermal channel. 
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Extended Data Fig. 3 | Effective lifetime analysis. a, Kinetic traces ofthe _ lifetimes plotted against time-averaged O> flux. Solid circles are 


thermal and the hyperthermal channel fitted by a convolution of an experimental points for k!2’™", The y-intercept is non-zero owing to CO 
exponential decay over the incoming beam. b, Kinetic model for the desorption. In these measurements, the time-averaged CO flux was 
effective lifetime of the thermal channel of CO oxidation. c, Effective 2.2 x 10% 57! cm~?. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


T/K 


700 650 600 550 500 
10 
9 
8 
7 
- 6 
Oo 
en) 
® 
52) 
4 
= 
3 k,'s from thermal channel 
2 CTC (steps) 
VLAPC-HeD (terraces) 
1 VLAPC-MBPS (terraces) 
Golibrzuch et al. (steps) 
0 
A 
-2 
0.0014 0.0016 0.0018 0.0020 0.0022 
1/T 1K" 
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desorption of CO from steps; see ref. '*. The filled red circles show rate 
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Extended Data Fig. 5 | Points along the reaction coordinate of the 
terrace—terrace reaction. The transition state resembles reactants; it is 

a so-called ‘early-barrier’ reaction. Such reactions channel the energy 
released in the reaction primarily into product vibration. Experimentally 
we observe that approximately 20% of the barrier-height energy (0.38 eV) 
appears as product transitional energy in the hyperthermal channel. 

We take this as evidence that the hyperthermal reaction takes place on 
platinum terraces. 
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Extended Data Fig. 6 | Examples of the kinetic model fit to 
experimentally derived kinetic traces for Pt(111). Nine out of 18 (every 
second) kinetic traces at 340°C are shown. Similar fit quality was obtained 
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for 18 plots for seven temperatures between 290 and 350°C. Information 
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on the total oxygen coverage [O,], the fractional coverage on steps and 
terraces (or and Oos, respectively) and the time-averaged O flux is shown 
above each plot. The time-averaged CO flux was 2.2 x 10’? s~'cm~*. 
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Extended Data Fig. 7 | Oxygen coverage on platinum surfaces. 

a, A comparison of the values of total adsorbed oxygen [O,] obtained 
from titrations on Pt(111) with [O,] values obtained from the numerical 
solution. The black squares with estimated uncertainty (1 s.d.) are the 
total amount of oxygen on the surface, [O,], obtained from titration 
measurements. The red dots show [Oa] = [Oterr] + [Ostep] obtained 

from the numerical solution at 340°C. The time-averaged CO flux was 

2.2 x 10!2s~! cm~*. b, The fractional step coverage (80s) on Pt(111) from 
the numerical solution compared to the partition function simulation. The 
black squares are the result of the numerical fit shown in Extended Data 
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Fig. 6. All data are at 340°C. c, A comparison of values of [O,] obtained 
from titrations on Pt(332) with [O,] values obtained from the numerical 
kinetic model. Note that the titration results are found to be independent 
of surface temperature under our conditions. d, The fractional step 
coverage (0s) plotted against [O,] for Pt(332). The red lines are results 
from the partition function calculated for different binding energy 
differences. The black triangles are the results from the numerical analysis 
of the kinetic model. Two extreme cases—no oxygen-atom binding 
preference for steps (dashed) and large oxygen-atom binding preference 
for steps (solid)—for the partition function are shown as black lines. 
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Extended Data Fig. 8 | Examples of the kinetic model fit to 
experimentally derived kinetic traces for Pt(332). All three reactive 
contributions are shown. Here Ts = 320°C and the oxygen coverage varies 
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out and (at high temperatures) where industrial catalysts are used. 
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The East Antarctic Ice Sheet (EAIS) is the largest potential 
contributor to sea-level rise. However, efforts to predict the future 
evolution of the EAIS are hindered by uncertainty in how it 
responded to past warm periods, for example, during the Pliocene 
epoch (5.3 to 2.6 million years ago), when atmospheric carbon 
dioxide concentrations were last higher than 400 parts per million. 
Geological evidence indicates that some marine-based portions of 
the EAIS and the West Antarctic Ice Sheet retreated during parts of 
the Pliocene!”, but it remains unclear whether ice grounded above 
sea level also experienced retreat. This uncertainty persists because 
global sea-level estimates for the Pliocene have large uncertainties 
and cannot be used to rule out substantial terrestrial ice loss’, and 
also because direct geological evidence bearing on past ice retreat 
on land is lacking. Here we show that land-based sectors of the 
EAIS that drain into the Ross Sea have been stable throughout the 
past eight million years. We base this conclusion on the extremely 
low concentrations of cosmogenic !°Be and 7°Al isotopes found in 
quartz sand extracted from a land-proximal marine sediment core. 
This sediment had been eroded from the continent, and its low levels 
of cosmogenic nuclides indicate that it experienced only minimal 
exposure to cosmic radiation, suggesting that the sediment source 
regions were covered in ice. These findings indicate that atmospheric 
warming during the past eight million years was insufficient to cause 
widespread or long-lasting meltback of the EAIS margin onto land. 
We suggest that variations in Antarctic ice volume in response to the 
range of global temperatures experienced over this period—up to 2-3 
degrees Celsius above preindustrial temperatures‘, corresponding to 
future scenarios involving carbon dioxide concentrations of between 
400 and 500 parts per million—were instead driven mostly by the 
retreat of marine ice margins, in agreement with the latest models. 

The configuration of the EAIS during the Pliocene epoch has been 
a longstanding topic of debate (see ref. ’ and references therein). 
Although the marine 680 record (a proxy for global ice volume) is 
consistent with permanent EAIS establishment 14 million years ago 
(Ma), studies of Pliocene marine diatoms from glacial sediments high 
in the Transantarctic Mountains (TAMs) first raised the possibility that 
the ice sheet retreated substantially, resulting in an interior seaway, and 
then regrew as recently as 3 Ma. In apparent contradiction, geomorphic 
investigations identified ancient land surfaces in the TAMs, implying 
persistent polar desert conditions for the past several million years. 
This finding, together with evidence that the diatoms were not in situ 
but rather wind transported, led to doubts surrounding the scale of 
a late Neogene deglaciation. More recently, records from land- 
proximal marine sediments revealed sea surface temperatures that were 
much warmer than present, a lack of summer sea ice and the retreat 
of marine-based parts of the EAIS and the West Antarctic Ice Sheet 
(WAIS) numerous times during the Pliocene!*, 

Recent modelling may have reconciled these seemingly disparate 
observations, suggesting that the retreat of marine-based ice-sheet 


sectors during the Pliocene allowed winds to loft diatoms from 
isostatically uplifted marine sediments onto TAM tills that may in fact 
be much older®?. Atmospheric warming in these Pliocene simulations 
is insufficient to drive widespread surface melting and the retreat of 
land-terminating ice margins. Nonetheless, the resolution of the mod- 
els is coarse relative to the width of potential ablation zones on the 
steep flanks of the ice sheet, and observations show that even in today’s 
climate there is widespread surface melt as high as 1,300 m above sea 
level around Antarctica!°. Moreover, Pliocene sea-level reconstructions 
cannot exclude the possibility that the EAIS margin retreated onto land. 
Global estimates of sea level during Pliocene interglacials show a wide 
range—from 5 m to more than 40 m higher than today*'!—owing to 
persistent uncertainties associated with extracting the sea-level sig- 
nal from the marine '8O record and from palaeo-shorelines. While 
most of this sea-level rise could be accounted for by loss of the present 
Greenland Ice Sheet (7 m) and/or marine-based portions of the WAIS 
(3 m) and EAIS (19 m), the highest estimates would also require input 
from EAIS land-based sectors (which contain 34 m sea-level equiva- 
lent)!*. To date, however, there has been no direct evidence confirming 
that the EAIS margin retreated onto land during the past few million 
years. 

Here we test directly for retreat of the EAIS margin onto land and 
for consequent exposure of the land surface to cosmic rays over the 
past 8 million years (Myr) by measuring concentrations of the cos- 
mogenic nuclides '°Be and 7°Al in quartz sands from the AND-1B 
sediment core, retrieved from beneath the Ross Ice Shelf (Figs. la 
and 2c). While diatom-rich units in AND-1B were used previously 
to identify numerous open-water intervals! associated with collapse 
of the WAIS8, we focused instead on terrigenous sediments from the 
intervening glacial units (Fig. 2) sourced from the EAIS. Cosmogenic 
nuclides are suitable for evaluations of past terrestrial retreat of the 
EAIS, because they are diagnostic of subaerial land exposure; these 
nuclides do not form in measurable concentrations under ice sheets, 
which shield rock surfaces from cosmic radiation. Cosmogenic 
nuclide concentrations in terrestrial materials from glaciated regions 
are controlled by a combination of ice-sheet extent and erosional 
processes'*. Nuclide production is highest at the ground surface, 
decreases exponentially in the top few metres and then extends tens 
of metres deeper at much lower rates!*. Exposure drives nuclide con- 
centrations higher with time; however, nuclides are also lost owing 
to radioactive decay (the half-life of "Be is 1.39 Myr and that of 7°Al 
is 0.71 Myr) and erosion, which strips nuclide-rich surface material. 
Glaciation tends to favour both of these concentration-lowering pro- 
cesses because it halts nuclide production and, under areas of erosive 
ice, removes previously exposed material from the bed. The nuclide 
concentration preserved in sediments carried from land to an adja- 
cent ocean reflects a convolution of exposure and burial history along 
with erosion depth, integrated along ice-flow lines and weighted by 
erosion rate!’. 
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Fig. 1 | Locations of the AND-1B core in the Ross Sea and ice-flow 
lines to the core site, along with simulated ice-sheet retreat during the 
Pliocene. The location of the AND-1B core is marked with a star. 
a, Some areas upstream of the core site are below sea level at presen 
(light grey), but most would rise above sea level 10,000 years after 
deglaciation of marine-based ice-sheet sectors”* (dark grey) and be 
exposed to cosmic radiation. WB, Wilkes Basin; triangles, Transantarctic 
Mountains. The dotted lines and arrows show ice-flow lines. Figure 
modified from ref. !. b, Ice flows to the core site from the Skelton and 
Mulock outlet glaciers of the EAIS at present, and probably did so 
throughout the Plio-Pleistocene’®. Figure modified with permission from 
ref. '°. c, Simulated Pliocene ice-sheet configuration (solid line), taken 
from ref. °, with retreat restricted almost entirely to marine-based sectors 
of the ice sheet. The dashed line shows the present ice extent. 
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The AND-1B sediments that we measured probably represent a 
relatively large area well upstream of the core site. Sand provenance 
indicators suggest that AND-1B glaciogenic sediments were likely to 
have been sourced from the Skelton and Mulock Glacier region of the 
TAMs throughout the Plio-Pleistocene!®, and perhaps also the Wilkes 
Basin further inboard beneath the EAIS (Fig. 1). Indeed, Last Glacial 
Maximum tills further outboard in the Ross Sea appear to have been 
derived from the ice-sheet interior and from the entire width of the 
TAMs, which are traversed by outlet glaciers!®. Modelling suggests 
that outlet glaciers may have been more active during past warm peri- 
ods, sliding and eroding along much of their lengths'’, and erosive 
zones would have migrated inward as the ice-sheet margin retreated 
(Extended Data Fig. 1). The volcanic rocks in McMurdo Sound sur- 
rounding the core site do not contain quartz’, precluding a local con- 
tribution. These source areas would have accumulated cosmogenic 
nuclides if substantial ice loss had occurred in the past. The TAMs are 
above sea level and would experience exposure to cosmic rays during 
ice-free periods. The Wilkes Basin lies below sea level today”, but much 
of it would have rebounded above sea level had the ice sheet experi- 
enced major retreat (Fig. 1a and Extended Data Fig. 2). 

We evaluated whether AND-1B sediments have high concentrations 
of cosmogenic nuclides indicative of past ice retreat by using 62 diam- 
ictite samples from the top 786 metres of the core. These were amalga- 
mated and purified to obtain sufficient quartz sand for measurement, 
yielding eight samples (designated A-H, from top to bottom), which 
span core intervals of 0.2-260.7 metres in thickness, or 0.002 Myr to 
2.166 Myr in duration (Fig. 2; see also Methods and Supplementary 
Information). Procedural blanks extracted in addition to the samples 
quantify background nuclide abundances characteristic of sample pro- 
cessing and accelerator mass spectrometry analysis (see Methods and 
Supplementary Information). 

In general, AND-1B nuclide concentrations are only marginally 
higher than those of procedural blanks, particularly for *°Al (Fig. 3); 
thus, they are indicative of little, if any, near-surface exposure. We 
used several statistical techniques to evaluate the certainty with which 
sample measurements exceed analytical backgrounds. First, cumula- 
tive frequency distributions suggest that the sample population as a 
whole contains low but measurable concentrations of both '°Be and 
2©A] (Extended Data Fig. 3). Second, we used both frequentist and 
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Fig. 2 | AND-1B stratigraphy and cosmogenic nuclide samples. 

a, Lithostratigraphy of the AND-1B core”’ (m.b.s.f., metres below sea 
floor). b, Masses of sand (grains greater than 125 jm in size) in the 62 
samples used here (plus symbols; see also Supplementary Information). 
Quartz was extracted from these samples and amalgamated, yielding eight 
final samples (labelled A to H and shaded grey) used for measurement of 
cosmogenic nuclides. We note that there is a roughly 200-m gap between 
samples G and H. c, Age control points based on palaeomagnetic events 
(black squares), “°Ar/*’Ar ratios (maroon circles) and diatom data (green 
diamonds)”’. Arrows designate maximum or minimum limiting ages. 


Bayesian statistical approaches to consider whether the overall sample 
population as well as individual samples contain cosmogenic nuclides 
at concentrations above background (Methods; Extended Data Tables 1 
and 2). We found that all but the oldest sample contain more “Be than 
process blanks, and six of eight samples (B, C, E, K G and H) contain 
?6 A] above blank level. We corrected sample nuclide concentrations for 
background using blank values, and for decay on the sea floor using the 
core-age model (see Methods). Although the ?641/!°Be ratio can be used 
to monitor burial duration after exposure—because the faster decay of 
?6Al relative to !°Be causes the ratio to decrease from its production 
value’4—the relatively large uncertainties of the low-concentration 
AND-1B data preclude resolution of meaningful decay-corrected 
26 \1/!Be ratios. 

The presence of !°Be and 7°Al in AND-1B sediments provides une- 
quivocal evidence for past landscape exposure in source regions. The 
relatively short half-life of 7°Al requires that this exposure occurred 
after EAIS expansion at 14 Ma, otherwise none would remain today 
(Extended Data Fig. 4). However, the decay-corrected nuclide 
concentrations are extremely low, declining from only about 12,000 
to 600 '“Be atoms per gram, and from 120,000 to 6,000 *°Al atoms 
per gram, over the Plio-Pleistocene (Fig. 4a and Extended Data 
Fig. 4)—the equivalent of only 150-3,000 years of surface expo- 
sure at sea level (where the '°Be production rate is about four atoms 
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Fig. 3 | Cosmogenic nuclide abundances. a, b, Probability distribution 
functions showing the measured abundance (total number of atoms) of 
10Be (a) and 7°Al (b) in AND-1B samples (coloured lines). Also included 
are all laboratory process blanks run alongside low-level samples in the 
same hood by the same operator (shown as translucent grey). Labels A-H 
indicate samples from top to bottom in the core, and asterisks denote 
samples that have nuclide abundances above blank level at 90% confidence 
according to Bayesian analysis (see Methods). Note that the x axes in the 
two panels have different scales. 


per gram per year), and even less at higher elevations. It is possible 
that these small concentrations of nuclides result from incorpora- 
tion of minor amounts of material derived from nunataks in the 
TAMs’®. In any case, these low concentrations are evidence that any 
Pliocene land-based ice-sheet retreat upstream of AND-1B was at 
most very limited in duration or extent. Several arguments support 
this interpretation. 

First, modelling suggests that in most plausible scenarios, the 
AND- 1B nuclide concentrations are consistent with spatially or tem- 
porally limited ice retreat during glacial cycles of the Pliocene and 
especially the Pleistocene (Methods); repeated exposures, even if rel- 
atively brief, would have led to higher concentrations than we meas- 
ured (Extended Data Fig. 5). A single, long-lasting Pliocene exposure 
event—perhaps during the warm, roughly 3.6-3.4-Ma interval implied 
by a 60-m-thick diatomite in the AND-1B core'—could have yielded 
‘Be concentrations consistent with our Pliocene data if erosion 
rates were at least tens of metres per million years once the ice cover 
returned. However, without continued exposure, such high erosion 
rates cause '"Be concentrations to decline more steeply over time than 
observed in our record (Extended Data Figs. 6 and 7), and frequent 
exposure seems unlikely given that temperatures and sea level were 
rarely higher than at present during the Pleistocene'*”°. The decreas- 
ing trend in AND-1B nuclide concentrations instead nearly follows a 
decay curve (Fig. 4a)—a trend that is most consistent with low rates 
of bedrock erosion under continuous ice cover since the last episodes 
of exposure. 

Second, the persistent ice cover suggested by our record is also con- 
sistent with the perennially frozen conditions that some have argued 
are needed to explain the extreme landscape stability in the TAMs 
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Fig. 4 | AND-1B 'Be record. a, Decay-corrected '°Be concentrations from 
the AND-1B core. Shading shows 1o uncertainty. The vertical green bar 
near 8 Ma represents the possible range of decay-corrected concentrations 
in the oldest AND-1B sample that was below detection limit at the time of 
measurement (see Methods). The dashed black line shows the decay curve 
that would end at the youngest AND-1B sample. b, Benthic 6'8O record, a 
proxy for global ice volume and deep ocean temperature”®. c, Atmospheric 
CO) concentrations from ice cores”’ (purple line towards the right), and 
from palaeosol 8'3C (red), alkenone §!3C (yellow), stomata (green) and 
marine 6'!B (blue), with 1a uncertainties*”. 


over the past few million years, which is reflected in high cosmogenic 
nuclide concentrations in exposed surfaces and intact surficial ash 
deposits’. 

Finally, relative stability of the EAIS is evident when the AND-1B 
cosmogenic marine sediment record is compared with similar records 
from other ice sheets. Decay-corrected AND-1B !°Be concentrations 
are several times lower than values for Greenlandic sediment over the 
same time period! (Plio-Pleistocene averages of 7,000 and 25,000 
atoms per gram, respectively), which are in turn several times lower 
than values for tills from the Laurentide Ice Sheet”? (Pleistocene 
average of 70,000 atoms per gram). These patterns are consistent with 
repeated North American interglacial exposure sustaining higher 
nuclide concentrations, a generally present but dynamic Greenland 
Ice Sheet, and persistent Antarctic ice cover maintaining low 10Be 
concentrations. 

How well does AND- 1B represent the broader EAIS? Models simu- 
lating EAIS deglaciation suggest that ice may linger in the higher ele- 
vations of the TAMs, but all show the ice sheet generally contracting 
inwards, with considerable land area that is currently upstream of the 
core site being exposed relatively early during retreat??4. Furthermore, 
these models suggest that warming consistent with even the most 
intense interglacials of the past 8 Myr would have produced minimal 
land exposure around the entire EAIS margin®*. The near absence of 
cosmogenic nuclides in our record therefore rules out substantial and 
long-lasting EAIS retreat onto land during the past 8 Myr—a finding 
that is consistent with recent seismic imaging off the Aurora Basin”®, 
and which could be confirmed by similar cosmogenic nuclide meas- 
urements in cores elsewhere around Antarctica. Our results also put 
an upper limit on estimates of Pliocene sea level, because melting 
of all marine-based ice in Antarctica!” and the Greenland Ice Sheet 
could contribute at most 30 m of sea-level rise. Together with prior 
evidence for open waters at the AND- 1B site during the Pliocene!, our 
findings agree with model simulations which show that the terrestrial 
EAIS experiences minimal melt when carbon dioxide levels are at their 
present value of roughly 400 parts per million for extended periods of 
time, whereas some marine-based ice-sheet sectors largely disappear>”* 
(Figs. 1c and 4c). 
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METHODS 

Sediment samples. We prepared a total of 62 samples, ranging in thickness from 2 cm 
to 70 cm, from the top 786 m of the AND-1B core. The top 61 samples are from 
Facies 10, consisting of diamicton interpreted as subglacial sediments, although 
possibly also associated with rainout from floating ice and mass flows*!. The lowest 
sample is from Facies 4, interpreted as hemipelagic muds and clasts rained out from 
icebergs or an ice shelf*!. The samples were disaggregated by soaking in weak acid 
and then wet-sieved; then, because of their small masses, fractions of greater than 
125 1m from adjacent samples were amalgamated until they totalled around 200 g, 
yielding 14 samples. Quartz was isolated through an initial heated hydrochloric 
acid ultrasonic etching, followed by repeated weak (0.25%) hydrofluoric acid/nitric 
acid etching; the fraction that was greater than 1,000 j1m was sieved to remove 
larger grains freed up during acid disaggregation. Because some of the resulting 
quartz separates were too small for !°Be measurement, we amalgamated adjacent 
samples once again, producing eight final samples with quartz masses of 14-20 g, 
spanning core intervals of 0.2-260.7 m in thickness, or 0.002 Myr to 1.763 Myr 
in duration. Sample processing details are given in Supplementary Information. 
Cosmogenic nuclide extraction. }°Be and 7°Al were extracted from quartz at the 
University of Vermont, following the methods of ref. °°. We added around 250 pg 
of *Be to each sample using a beryl carrier made at the University of Vermont. 
The total ?”Al was quantified using inductively coupled plasma optical emission 
spectrometry analysis of replicate aliquots removed from samples directly after 
digestion; measurements were made using an internal standard (Y) and two differ- 
ent Al emission lines. The!’Be/*Be ratios were measured at the Lawrence Livermore 
National Laboratory and normalized to primary standard 07KNSTD3110, assum- 
ing a !°Be/*Be ratio of 2.85 x 10~!? (ref. 93). 7°Al/*’Al ratios were measured at 
the Purdue Rare Isotope Measurement Laboratory and normalized to KNSTD 
standard 26AI-1-05-2, with an assumed 7°Al/?’Al ratio of 1.82 x 1071? (ref. 34). 
Isotopic data are given in Supplementary Information. 

Blanks. Two procedural blanks were prepared with the samples during nuclide 
extraction to estimate the background nuclide abundances resulting from both 
laboratory sample processing and accelerator mass spectrometry (AMS) analysis. 
To characterize more fully the possible range of backgrounds, we also considered 
all blanks run by the same operator in the same fume hood (dedicated exclusively 
to the preparation of low-ratio samples) with other long-buried samples from 
beneath the Greenland Ice Sheet (n = 26 for !°Be; n = 18 for 7°Al). We quantified 
the central tendency and dispersion of the sample and blank populations using 
their arithmetic means and standard errors. All blanks were measured at the same 
AMS facilities as the samples (Lawrence Livermore National Laboratory for '°Be, 
Purdue Rare Isotope Measurement Laboratory for Al). 

Statistical analysis. Traditionally, frequentist methods (that is, null-hypothesis 
significance tests; for example, the t-test) are applied to sample results and process 
blanks in order to determine whether the two groups are statistically different or if 
individual samples are statistically distinct from a group of blanks. A null hypoth- 
esis is framed (for example, ‘the true mean of the samples is less than or equal to 
the true mean of the blanks’) and is rejected for P values less than the significance 
level (a). However, under a frequentist approach, one cannot express support for 
the research hypothesis (for example, that a given sample result is greater than 
the mean of the blanks). A low P value simply means there is a low chance that 
our data would support the null hypothesis simply by random chance’. In other 
words, the probability of incorrectly rejecting the null (type I error) is low. On the 
other hand, a Bayesian analysis can support either a null or a research hypothesis, 
provide explicit information about the precision of parameter estimations in the 
form of credible intervals, and allow data to be examined in the context of the effect 
size*>->”, Bayesian frameworks have the added advantage of allowing for non- 
normal distribution of datasets**, which can be a common occurrence when work- 
ing with low-level detection samples*’. Notably, our AND-1B datasets exhibit some 
non-normality. While !°Be blanks and samples and 7°Al blanks were each from a 
normal distribution, as confirmed by the Shapiro- Wilks test (at a significance level, 
a, of 0.05), the group of ?6A] blanks was not normally distributed. 

We applied Bayesian inference to estimate distribution parameters (including 
the mean and standard deviation) for our AND-1B datasets, and to compare 
parameters between the samples and blanks. In the context of the t-test, our 
response variables (!°Be and 7°Al reported in the samples and the blanks) were 
modelled as having a t-distribution to accommodate potential outliers. The 
t-distribution is similar to a normal distribution, parameterized by the mean (1) 
and the standard deviation (c), but also includes a third parameter (v) that accom- 
modates heavy-tailed distributions (v < 30) and near-normal distributions (v = 30). 

Model parameters included (1, j2 01, 02 and v, with the subscripts denoting 
the identity of the blanks (group 1) and samples (group 2). Non-informative priors 
were assigned to each parameter. A normal distribution was assigned to j1; and ji, 
centred on the empirical mean of pooled data (both blank and sample groups), with 
a variance ranging from 0.2 to 5 times the standard deviation of the pooled data. 
Standard deviation parameters—o, and 0.—were each assigned a prior distributed 


as a gamma density with shape and rate parameters “derived from the desired 
mode and standard deviation of the gamma distribution”*°. Equal normality was 
assumed for both groups, with a prior for v distributed as a gamma density with 
shape and rate parameters “derived from the desired mean and standard deviation 
of the gamma distribution”. 

We used Gibbs sampling to obtain samples from the posterior distribution 
and to estimate the mean, standard deviation, standard error of the mean, mode, 
quantiles and credible intervals for each Bayesian model parameter. Markov-chain 
Monte Carlo sampling was implemented in R code"! using the ‘BEST’ package”, 
which relies on ‘JAGS" and ‘coda’* code. Sampling was conducted with three 
chains, for 100,000 iterations with a thinning factor of 1, after discarding the initial 
4,500 iterations for adaptation and burn-in phases. Convergence was confirmed 
by visual examination of trace plots and the Gelman—Ruben statistic** (that is, 
requiring a potential shrink-reduction factor of less than 1.1). 

We first compared the blanks with the samples for each nuclide (expressed in 
atoms), applying a one-sided, two-group t-test (2GTT) using both frequentist and 
Bayesian approaches. Notably, our blank results included all laboratory process 
blanks run alongside low-level samples in the same hood by the same operator 
over time (n = 26 for !°Be and n = 18 for 7°Al). 

A frequentist one-sided 2GTT (unequal variances, Welch approximation) deter- 
mined that the mean of the samples may be greater than the mean of the blanks 
for both !°Be and 7°Al. The null hypothesis (that the true mean of the samples 
is less than or equal to the true mean of the blanks) was rejected at a = 0.05 
(P < 0.05; Extended Data Table 1). 

A Bayesian, one-sided 2GTT yielded consistent results, but provided support 
for the research hypothesis (that is, that the sample mean is greater than the blank 
mean) and quantified the probability. Results indicate that the two groups (samples 
and blanks) are credibly different for both !°Be and °Al; moreover, there is a greater 
than 99% probability that the sample mean is greater than the blank mean for 
each nuclide (Extended Data Table 1). The 95% credible interval on the estimated 
difference between group means excludes zero for both !°Be and 7°Al. 

Next, we compared each individual sample with the mean of the blanks to deter- 
mine whether the samples contained credible quantities of !°Be and *°Al, using 
both frequentist and Bayesian methods. The two methods yielded contrasting 
outcomes, with the value of Bayesian over frequentist methods becoming especially 
apparent from this analysis. 

We performed the frequentist 1GTT to evaluate the null hypothesis that the 
sample result was less than or equal to the blank mean. For !“Be, we obtained a low 
P value (less than 0.10) for seven of the eight samples (A to G), rejecting the null 
hypothesis. However, this 1GTT failed to reject the null hypothesis for reported 
10Be in sample H. For 7°Al, a low P value (less than 0.10) resulted in a rejection 
of the null hypothesis for all eight samples. However, frequentist analysis does 
not permit us to express support for the research hypothesis—that is, that Al is 
credibly present in these eight samples, or that !°Be is credibly present in seven of 
the eight samples—only that there is a low probability of a type I error. 

Bayesian 1GTTs were carried out by defining a region of uncertainty (ROU) 
around the sample-specific estimate equal to +1o. Bayesian 1GTT results indi- 
cated that Be is credibly present above the mean blank value in seven of the eight 
samples (all but H). In each credible case, the blank mean is less than the sample 
value, and a 90% credibility interval on the posterior distribution of the mean of 
the blanks fully excludes the +1a ROU surrounding the sample value (Extended 
Data Fig. 8a). By contrast, for sample H, the mean of the blanks is greater than the 
reported sample value (Extended Data Fig. 8b). An estimated 92.9% of the poste- 
rior distribution is above the sample value, and 89.9% of the posterior distribution 
of the mean is located within the sample-specific ROU (Extended Data Table 2). 

In the case of 7°Al, Bayesian 1GTT results indicated that this constituent is 
credibly present above the mean blank value in samples B, C, E, E, G and H only. 
Samples A and D may be, to varying degrees, indistinguishable from the process 
blanks (Extended Data Table 2). Although the 90% credibility interval on the pos- 
terior distribution of the blank mean was less than, and fully excluded, the respec- 
tive sample value, a percentage of the posterior distribution (17.5% and 10.8%, 
respectively, for A and D) was located within the sample-specific ROU—including 
portions of the 90% credibility interval (Extended Data Fig. 8c). 

Thus, the Bayesian 1GTT results contrast with those of the frequentist null- 
hypothesis significance tests. Although significant P values (at a = 0.10) obtained 
in the frequentist 1GTT may have led one to conclude that '°Be is present in seven 
out of eight samples and that 7°Al is present in all eight samples with a ‘confidence’ 
interval of 90%, we can only state a low chance of having incorrectly rejected a true 
null hypothesis (that is, generated a ‘false positive’) for these samples. The Bayesian 
approach, however, allowed us to express support for the research hypothesis that 
10Be is credibly present above the mean of the blanks in samples A to G, and that 
26] is credibly present above background only in samples B, C, E, F, G and H. 

It is difficult to explain the presence of *°Al but the absence of !°Be in sample H 
given the shorter half-life of ?°Al. Because the 7°A] abundance in this sample barely 
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exceeds the threshold for statistical significance (Fig. 3b), we consider it dubious 
that this sample in fact contains measurable 7°Al, and disregard it. 

Blank correction. To estimate the concentration of nuclides in the samples, we 
subtracted the mean nuclide abundance of all the blanks. We quantified the blank 
uncertainty as the standard error of the mean, and combined it with the measure- 
ment uncertainty of the sample nuclide abundances in quadrature. 

Decay correction. Using the age model from ref. *”, we corrected measured cos- 
mogenic nuclide concentrations for radioactive decay on the seafloor following 
deposition; this age model is based on palaeomagnetic events, biostratigraphic data 
and “°Ar/*’Ar ages. Because measurements were made on amalgamated subsam- 
ples that span considerable depth (and thus age) ranges in the core, we used the 
average age of the subsamples, weighted by their sand (more than 125 jum) masses. 
Our cosmogenic record is relatively insensitive to age-model uncertainties because 
the age uncertainties are substantially smaller than the half-lives of !°Be and 7°Al 
(1.39 Myr, ref. 46. and 0.71 Myr, ref. a): 

Limiting initial nuclide concentrations of blank-level samples. We estimated 
the maximum concentrations that samples without measurable nuclides could 
have had when they were deposited and still decayed to blank levels by today. For 
example, our average 10Be blank has about 7,000 atoms. This implies that the oldest 
sample (H), which does not have measurable 10Be, could have had up to some 
380,000 atoms (or around 19,000 atoms per gram for this 20.1-g sample) when it 
was deposited 8 Ma (5.75 half-lives ago). 

Antarctic erosion rates. The AND-1B cosmogenic nuclide record is erosion 
weighted because sediments are sourced from areas in proportion to their ero- 
sion rate. In addition, areas of more rapid erosion contribute deeper-sourced, 
and thus nuclide-poorer, material, potentially diluting the cosmogenic nuclide 
concentration resulting from a past exposure event. Modelling, sediment back- 
stacking techniques, subglacial imaging, and thermochronometry suggest that 
while Antarctic subglacial erosion rates may have reached up to 200 m per Myr in 
spatially isolated regions in the past, spatial averages were typically much lower 
(1-2 m per Myr)**-°°, particularly during the late Cenozoic, when landscape 
dissection probably slowed with colder conditions and less aggressive glacial 
erosion*?3}>?, 

Modelling hypothetical exposure scenarios. We generated synthetic !°Be and 7°Al 
records to inform our interpretation of the AND-1B record, using the MATLAB 
implementation of refs. **°4, and including nuclide production by muons. Ina first 
experiment, we evaluated whether the 6A] measured in our record could have 
been produced before EAIS expansion at 14 Ma; we found that it could not. For 
instance, even in an extreme scenario with a high-elevation, non-eroding surface 
saturated with *°Al (namely, equal rates of nuclide production and loss by radioac- 
tive decay) at 14 Ma and subsequently decaying for 20 half-lives under cold-based, 
non-erosive ice cover, there would not be any detectable *°Al remaining today 
(Extended Data Fig. 4). 

Inasecond set of simulations, we calculated !°Be and 7°Al concentrations in sed- 
iment eroded from a bedrock profile averaged over multiple glacial cycles, driven 
by high-latitude production rates at sea level and at 2,000 m above sea level, and 
given various durations of exposure per cycle (0%-100%) and glacial erosion rates 
(0-100 m per Myr). AND-1B Pliocene !°Be concentrations are consistent with 
exposure lasting more than 10% of a glacial cycle only in cases with low-elevation 
sediment sources and very high erosion rates; most scenarios that are in agreement 
with our AND-1B data instead imply little to no exposure during the Pliocene, and 
especially during the Pleistocene (Extended Data Fig. 5). 

Finally, we tested whether a single episode of exposure for 10 thousand years 
(kyr), 50 kyr, 100 kyr or 200 kyr during the mid-Pliocene could yield a '°Be 
record similar to ours. We assumed that the bedrock had no cosmogenic nuclides 
initially, that rock was eroded at various rates (0, 20 or 100 m per Myr), and that 
the resulting sediment was then transported instantaneously to the sea floor, 
where the radionuclides were subject to decay only. The resulting time series were 
degraded to the resolution of our isotopic record. The results of these simulations 
suggest that both short-exposure, low-erosion and long-exposure, high-erosion 
scenarios could yield '°Be concentrations similar to those of our AND-1B Pliocene 
samples, but that the rapid erosion in the latter scenarios decreases !°Be concen- 
trations more quickly than we observe during the Pleistocene (Extended Data 
Fig. 6). The long mid-Pliocene exposure scenario would require either a substantial 
decrease in erosion rates from the Pliocene to the Pleistocene, or continued brief 
exposure events during the Pleistocene to match the entire AND-1B record. We 
also repeated the 200-kyr mid-Pliocene exposure simulation, but first mixed 
eroded bedrock into a deformable till layer before fluxing it to the ocean. We used 
deformable bed thicknesses of 1 m and 10 m and erosion rates of 10 m per Myr 
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and 80 m per Myr, assumed instantaneous homogenization of eroded bedrock 
material throughout the deformable bed in each time step, and removed an 
equal amount of bed material to the ocean to keep the bed thickness constant. 
This sediment mixing reduces the magnitude of the mid-Pliocene exposure 
signal and extends its longevity through time in comparison with the bedrock-only 
simulations, but it similarly fails to replicate the AND-1B record; high erosion 
rates are needed to reduce Pliocene !°Be concentrations in the regolith to AND-1B 
values, but low erosion rates are needed to have slowly discharged the regolith to 
the ocean over the past few million years (Extended Data Fig. 7). More complicated 
scenarios would be required to fit the AND-1B record, such as long-lived pockets 
of higher-!Be regolith mixing with more rapidly eroding, and thus lower-!°Be, 
bedrock. 

Data availability. All AND-1B data generated here (sediment processing, sam- 
ple and blank isotopic ratios and concentrations) are provided as Supplementary 
Information. 

Code availability. The codes used to model cosmogenic nuclide exposure and 
the erosion scenarios shown in the Extended Data Figures are available at https:// 
github.com/shakunj/Shakun-et-al-2018-Nature. 
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Extended Data Fig. 1 | Modelled patterns of erosion. a—d, Simulated The location of the AND-1B core is shown by the yellow dot. We note that 
erosion potential under the Antarctic Ice Sheet, calculated from modelled erosive zones tend to extend towards the continental interior with warming. 
driving stress and basal velocity fields for several uniform (atmosphere and _ dT, temperature anomaly from present; dV, ice-volume anomaly from 
ocean) warming scenarios of: 4 °C (a), 8 °C (b), 12 °C (c) and 15 °C (d)**. present, in sea-level equivalent (s.Le.). 
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Extended Data Fig. 2 | Glacial isostatic adjustment following ice retreat. | Model 1 is from ref. °°, and model 2 (our model) has the following 


a-d, Antarctic land above sea level (yellow) 0 kyr (a), 5 kyr (b), 10 kyr (c), parameters: lithosphere thickness, 96 km; upper-mantle viscosity, 
and 15 kyr (d) after a near-instantaneous (1-kyr) collapse of all marine- 5 x 107° Pas~1; and lower-mantle viscosity, 107? Pa s~!. The location of the 
based ice-sheet sectors, in two different models of mantle viscosity”. AND-1B core is shown by the star. 
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Extended Data Fig. 3 | Nuclide abundances in AND-1B samples versus 
blank populations. a, b, Cumulative exceedance probabilities of measured 
(that is, not blank-corrected) !°Be (a) and 7°Al (b) nuclide abundances in 
AND-1B samples (blue) and in all blanks run by the same operator in the 
same low-level fume hood (red), with 1o uncertainties. These plots display 
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the fraction of measurements that exceed a given nuclide abundance. Note 
that probabilities are generally higher for the samples than the blanks; 

in other words, a random draw from the samples is more likely to be 
above a random draw from the blanks, suggesting that they are separable 
populations. 
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Extended Data Fig. 4 | AND-1B decay-corrected 7°Al concentrations. 


Shaded intervals surrounding the blue line show 1o uncertainties, 
while shaded intervals not surrounding the blue line show the possible 
range of decay-corrected concentrations in samples that are below the 
detection limit. The dashed black line simulates the 7°Al concentration 
in non-eroding material at 2,000 metres above sea level (m asl) that was 
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originally saturated at 14 Ma and subsequently decayed under cold- 
based, non-erosive ice. The fact that several AND-1B samples have higher 
concentrations than those in this extreme scenario (which is the most 
favourable to having nuclides persist to the present) suggests that the 
AND-1B nuclides were produced after the expansion of the EAIS in the 
mid-Miocene. 
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Extended Data Fig. 5 | Modelled concentrations of cosmogenic nuclides 
for various durations of interglacial exposure and glacial erosion rates. 
a-—d, Simulated !°Be (a, b) and 7°Al (c, d) concentrations in material 
sourced from sea level and from 2,000 m asl in Antarctica as a function of 
the fraction of time for which land is exposed, during 40-kyr glacial cycles. 
(Results are nearly identical if the cycles are instead 100-kyr long.) Erosion 
rates were assumed to be 0 m per Myr during ice-free conditions, on the 
basis of geologic evidence for negligible late Cenozoic erosion in ice-free 
areas of the TAMs””°. Black arrows next to the scale bars show the range of 
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decay-corrected nuclide concentrations in AND-1B samples. The model 
was initialized with zero nuclides at 8 Ma (representative of conditions 
suggested by AND-1B sample H); the model also assumes instantaneous 
transport of eroded sediment to the ocean with no mixing, and continuous 
radioactive decay. Concentrations shown are the Pliocene (5 Ma to 3 Ma) 
average. Comparison of these simulations with AND-1B nuclide 
concentrations suggests that land exposure in sediment source regions was 
probably quite limited in duration or extent through the Plio-Pleistocene. 
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Extended Data Fig. 6 | Modelling a mid-Pliocene exposure of 
bedrock. a-d, Each panel shows actual AND-1B decay-corrected '°Be 
concentrations with 1¢ uncertainty (green), as well as simulated ‘Be 
concentrations assuming a single 10-kyr (a), 50-kyr (b), 100-kyr (c) and 
200-kyr (d) exposure of a bedrock column in the mid-Pliocene. The 
exposure event was chosen to start at 3.6 Ma and extend for up to 200 kyr 
in duration on the basis of the presence of a 60-m-thick diatomite unit 

in the AND-1B core, thought to reflect warm interglacial conditions 


from 3.6 Ma to 3.4 Ma'. Simulated records are driven by production at 

sea level (grey) or at 2,000 m asl (black), and are subjected to continuous 
radioactive decay and continuous erosion at rates of 0 m per Myr (solid 
lines), 20 m per Myr (dashed lines), and 100 m per Myr (dotted lines). The 
model assumes that the sediment source was initially devoid of nuclides 
and that sediments are transported instantaneously to the sea floor. The 
synthetic time series have been binned to the same resolution as the 
AND-1B data. 
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Extended Data Fig. 7 | Modelling a mid-Pliocene exposure event with 
eroded bedrock mixed through a deformable bed. The figure shows 
AND-1B decay-corrected '°Be concentrations with lo uncertainties 
(green). It also depicts simulated '°Be concentrations, assuming a single 
exposure event from 3.6 Ma to 3.4 Ma and routing of eroded bedrock 
through a well mixed deformable bed, for various bed thicknesses and 
erosion rates. Material eroded from the bedrock profile is instantaneously 
mixed throughout the deformable bed in each time step, and an equal 
amount of material is removed from the bed, keeping its thickness 
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constant. Sediment mixing in the deformable bed dilutes the surface ‘Be 
signal of the exposure event but extends its longevity through time in 
comparison with the bedrock simulations shown in Extended Data Fig. 6. 
Simulated records are driven by production at sea level, and subjected to 
continuous radioactive decay and continuous erosion. The model assumes 
that the bedrock and deformable bed were initially devoid of nuclides 

and that sediments eroded from the deformable bed are transported 
instantaneously to the sea floor. The synthetic time series have been 
binned to the same resolution as the AND-1B data. 
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Extended Data Fig. 8 | Conceptual diagram showing the outcomes of 
Bayesian one-group t-tests and their interpretation. a, Nuclides are 
credibly present above background: that is, the sample value is greater than 
the mean of the blanks (defined at the mode of the posterior distribution), 
and the region of uncertainty surrounding the sample value fully excludes 
the 90% credible interval (C.I.) on the posterior distribution of the mean 
of the blanks. The grey shaded regions give the uncertainty range in the 
sample nuclide concentration. b, Nuclides are not credibly present above 
background: the sample value is less than or equal to the blank mean. 

c, Nuclides are not credibly present above background: although the 
sample value is greater than the blank mean, the region of uncertainty 
surrounding the sample value does not fully exclude the 90% C.I. 
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Extended Data Table 1 | Comparison of nuclide abundances in sample populations and procedural blank populations 


Test Blanks Samples Two-sample, one- Blanks Samples Two-sample, one- 
10° atoms 10° atoms) _ Sided t-test (a=0.05) |(10* atoms) (10‘ atoms) sided t-test (a=0.05) 


7.0520.54 24.01=4.45 Frequentist: 4.1820.80 11.5941.29 Frequentist: 
n=26 n=8 p = 0.0033* n=18 n=8 p < 0.0001* 
rieiiautiat’ Rejects Hy (that mean Rejects Hy (that mean 
of the samples is less of the samples is less 
than or equal to the than or equal to the 
mean of the blanks) mean of the blanks) 


7.02=0.58 23.91=5.92 Bayesian: 3.98=0.89 11.40=1.68 Bayesian: 
n=26 n=8 99.4% probability that n=18 n=8 99.9% probability that 
sample mean is greater sample mean is greater 
than the blank mean than the blank mean 


Bayesian’ 


*The mean sample nuclide concentration is higher than the mean blank nuclide concentration at the 95% confidence level. 
tArithmetic mean and standard error of the mean of samples and of all blanks associated with the fume hood in which samples were processed by the same operator. 
+Bayesian estimation of mean and standard error of the mean of samples and of all blanks associated with the fume hood in which samples were processed by the same operator. 


We note that the mean and standard error values estimated using frequentist and Bayesian methods differ slightly, given that the former is derived arithmetically and the latter is derived through 
MCMC sampling. 
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Extended Data Table 2 | Comparison of nuclide abundances in individual samples and procedural blanks 


Sample 


Measured 
qo atoms) 


24.2122.91 
17.0629.13 
21.6823.33 
12.07=3.14 
44.67=4.45 
32.1225.05 
34.11=3.46 
6.167=1.58 


Measured 
qo* atoms) 


8.56£3.77 
18.87=5.44 
9.7723.71 
8.15+3.08 
10.1543.61 
15.1943.72 
11.2145.79 
10.79=2.92 


Corrected with blanks 
using Freq mean/SEM 
(10° atoms)’ 
17.16+2.96 
10.01+9.15 
14.6343.38 
5.0243.18 
37.6224.49 
25.07£5.08 
27.0643.51 
- 0.8841.67 


Corrected with blanks 
using Freq mean/SEM 
(10* atoms)’ 
4.392386 
14.69=5.50 
5.59£3.79 
3.97£3.18 
5.97£3.70 
11.02+3.80 
7.03£5.85 
6.6143.02 


Frequentist one- 
sample t-test 
(a=0.10)* 

p <0.10* 

p < 0.10* 

p <0.10* 

p <0.10* 

p <0.10* 

p <0.10* 

p <0.10* 

p = 0.943 


Frequentist one- 
sample t-test 
(a=0.10)* 

p <0.10* 

p <0.10* 

p <0.10* 

p <0.10* 

p <0.10* 

p <0.10* 

p <0.10* 

p < 0.10* 


Corrected with blanks 
using Bayes mean/SEM 
(10° atoms)* 
17.1942.97 
10.0449.15 
14.6643.38 
5.0543.19 
37.6524.49 
25.10£5.08 
27.0923.51 
- 0.85=1.68 


Corrected with blanks 
using Bayes mean/SEM 
(10* atoms)* 
4.59=3.88 
14.89=5.51 
5.8023.81 
4.17£3.21 
6.18+3.72 
11.22=3.82 
7.2345.86 
6.8143.05 


*The sample nuclide concentration is higher than the mean blank nuclide concentration at the 90% confidence level. 
Arithmetic average of blanks subtracted from measured atoms in each sample, with uncertainties added in quadrature. In calculating the blank average, a value of one-half the lowest detected blank 
value was substituted for the first two of the 2°Al blanks, which exhibited zero-count results. 
+The test result is significant at the specified a level where indicated by asterisks. The test compared non-corrected samples individually with the mean of the blanks. 

‘The Bayesian-estimated blank mean was subtracted from the number of measured atoms in each sample, with uncertainties added in quadrature, using the Bayesian estimated standard error of the 
mean (SEM) of the blanks. 
Values formatted as ‘a/b/c’ represent the following statistics: a, the percentage of the posterior distribution of the mean of the blanks that is above the sample value; b, the percentage of the posterior 
distribution of the mean of the blanks that is within the region of uncertainty (ROU) around the sample value (where the ROU is defined as +10); c, Y = yes and N = no in answer to the question 
“Constituent credibly present in sample?” (that is, is the 90% credibility interval of the posterior distribution of the mean of the blanks less than and fully excluding the ROU around the sample value?). 
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Rapid recovery of life at ground zero of the end- 
Cretaceous mass extinction 


Christopher M. Lowery!*, Timothy J. Bralower2, J eremy D. Owens’, Francisco J. Rodriguez-Tovar*, Heather J ones?, Jan Smit®, 
Michael T. Whalen®, Phillipe Claeys’, Kenneth Farley®, Sean P. S. Gulick!, Joanna V. Morgan’, Sophie Green!®, Elise Chenot!!, 

Gail L. Christeson!, Charles S. Cockell!2, Marco J. L. Coolen!’, Ludovic Ferriére, Catalina Gebhardt!>, Kazuhisa Goto", David A. 
Kring"’, Johanna Lofi!®, Rubén Ocampo-Torres"”, Ligia Perez-Cruz*°, Annemarie E. Pickersgill*4**, Michael H. Poelchau”’, Auriol 
S. P. Rae®, Cornelia Rasmussen!, Mario Rebolledo-Vieyra”’, Ulrich Riller, Honami Sato”, Sonia M. Tikoo”’, Naotaka Tomioka’®, 


Jaime Urrutia-Fucugauchi”, Johan Vellekoop’, Axel Wittmann”, Long Xiao*°, Kosei E. Yamaguchi*!*? & William Zylberman 


The Cretaceous/Palaeogene mass extinction eradicated 76% of 
species on Earth’”. It was caused by the impact of an asteroid** 
on the Yucatan carbonate platform in the southern Gulf of Mexico 
66 million years ago®, forming the Chicxulub impact crater”. 
After the mass extinction, the recovery of the global marine 
ecosystem—measured as primary productivity—was geographically 
heterogeneous’; export production in the Gulf of Mexico and North 
Atlantic-western Tethys was slower than in most other regions*"!!, 
taking 300 thousand years (kyr) to return to levels similar to those of 
the Late Cretaceous period. Delayed recovery of marine productivity 
closer to the crater implies an impact-related environmental control, 
such as toxic metal poisoning’, on recovery times. If no such 
geographic pattern exists, the best explanation for the observed 
heterogeneity is a combination of ecological factors—trophic 
interactions!%, species incumbency and competitive exclusion by 
opportunists!4—and ‘chance’®!*16, The question of whether the 
post-impact recovery of marine productivity was delayed closer to 
the crater has a bearing on the predictability of future patterns of 
recovery in anthropogenically perturbed ecosystems. If there is a 
relationship between the distance from the impact and the recovery 
of marine productivity, we would expect recovery rates to be slowest 
in the crater itself. Here we present a record of foraminifera, 
calcareous nannoplankton, trace fossils and elemental abundance 
data from within the Chicxulub crater, dated to approximately the 
first 200 kyr of the Palaeocene. We show that life reappeared in the 
basin just years after the impact and a high-productivity ecosystem 
was established within 30 kyr, which indicates that proximity to 
the impact did not delay recovery and that there was therefore 
no impact-related environmental control on recovery. Ecological 
processes probably controlled the recovery of productivity after the 
Cretaceous/Palaeogene mass extinction and are therefore likely to 
be important for the response of the ocean ecosystem to other rapid 
extinction events. 

The recent joint expedition of the International Ocean Discovery 
Program and International Continental Drilling Program (hereafter, 
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Expedition 364) recovered what is, to our knowledge, the first record 
of the few hundred thousand years immediately after the impact within 
the Chicxulub crater. Site M0077, which was drilled into the peak 
ring of the crater” (Extended Data Fig. 1), sampled an approximately 
130-m-thick, generally upward-fining suevite (that is, melt-bearing 
impact breccia) overlying impact melt rocks and fractured granite’’. 
The boundary between the suevite and overlying earliest-Palaeocene 
pelagic limestone is in core 40-1 (Fig. 1), and comprises a 76-cm-thick 
upward-fining, brown, fine-grained micritic limestone that we term 
the ‘transitional unit. The lower portion of the transitional unit is lam- 
inated below 54-cm core depth and contains no trace fossils (Fig. 1 
and Extended Data Fig. 2). The laminations are thin, graded beds with 
sub-millimetre-scale cross-bedding that indicates bottom currents, 
and are likely due to the movement of wave energy—including tsu- 
nami and/or seiches—in the days after the impact. The fine grain size 
(primarily clay to silt, with some sand-sized grains concentrated in the 
graded beds) suggests that much of the material in the transitional unit 
was deposited from resuspension and settling. The transitional unit is 
overlain by a white pelagic limestone. The lowermost sample taken in 
this limestone (34cm core depth) contains the planktic foraminifer 
Parvularugoglobigerina eugubina (which marks the base of Zone Pa), 
other foraminifer of the same genus (P. extensa, P. alabamensis) and 
Guembelitria cretacea. Because many other species that originate 
within Zone Pa first appear a few centimetres higher in the section 
(31-32 cm), we conclude that the base of the limestone lies very near 
the base of this zone, 30 kyr after the impact!®, 

Biostratigraphy and basic assumptions about depositional and cra- 
ter processes indicate that the transitional unit was deposited between 
several years and 30 kyr after impact (Fig. 2). To better constrain this, 
we use the abundance of extraterrestrial *He to determine sediment 
accumulation rates (see Methods). This proxy provides a firm upper 
limit of 8 kyr for deposition, assuming none of the *He is reworked. 
If even a small amount of *He is reworked (which is very likely given 
the prevalence of reworked microfossils and impact debris), then the 
transitional unit was deposited in a period of time of less than about 
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Fig. 1 | Palaeoproductivity indicators in the earliest Palaeocene at 

site M0077. The shaded area is the transitional unit and the dashed line 
represents the contact with the overlying pelagic limestone. Top to bottom: 
X-ray fluorescence-derived calcium abundance in counts per second 
(CPS); Ba/Ti and Ba/Fe ratios; percentage abundances of key planktic 
foraminiferal groups, including percentage of Guembelitria, percentage of 
survivors (that is, Cretaceous species known to survive the impact) and 
percentage of Danian taxa (that is, species that evolved after the impact) 


1 kyr, which is below the resolution of the method. With no sediment 
source other than settling of material suspended by the impact and 
subsequent tsunami and seiches, a more realistic estimate—based on 


30,000 25,000 


20,000 


15,000 


as a percentage of total foraminifera; foraminifera per gram of sediment, 
plotted on a logarithmic scale; percentage of Micula smaller than 21m 
(against total nannoplankton) and nannoplankton abundance (total 
occurrences per field of view (FOV)); percentage of benthic foraminifera 
(against total foraminifera); and core image of 364-M0077A-40R-1 

0-110 cm (616.58-617.33 m below seafloor), with discrete trace fossils 
highlighted by arrows (see Extended Data Fig. 2 for a larger version of this 
image). 


Stokes’ law—for the duration of this unit suggests about 6 years for the 
settling of a 2-1m grain of carbonate (an upper limit, as most grains are 
much larger; see Supplementary Information for further discussion). 


Years 


10,000 5,000 0 


Biostratigraphy < 30 kyr 


Fig. 2 | Constraints on the age of the transitional unit. Maximum 
durations of the transitional unit based on biostratigraphy (which suggests 
it was deposited in less than 30 kyr), extraterrestrial *He (which suggests 
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3He (any reworking) < 1 kyr i 


Stokes’ law < 6 years | 


it was deposited in approximately 8 kyr if there is no reworking, or less 
than 1 kyr if there is any reworking) and Stokes’ law, which suggests it was 
deposited in less than 6 years. 
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The lower portion of the overlying limestone, which contains fossils 
that appear approximately 30 kyr after the impact, appears conformable 
with the transitional unit and must therefore be condensed owing to 
low pelagic sedimentation in the first few tens of thousands of years 
after the impact. 

Clear, discrete trace fossils, including Planolites and Chondrites, char- 
acterize the upper 20cm of the transitional unit (above 54cm) (Fig. 1 
and Extended Data Fig. 2), providing unequivocal evidence for benthic 
life in the crater within years of the impact. Flattening of the structures 
indicates that the traces were formed while the sediment was still soft, 
during or shortly after the deposition of the transitional unit. Infilling of 
the burrows with brown, fine-grained micrite also suggests traces were 
syndepositional and not derived from mixing of the Danian limestone 
above the transitional unit. Trace fossils produced during deposition 
of the limestone, as indicated by light infilling material, are distinct and 
occur only in the uppermost few centimetres of the transitional unit 
(Extended Data Fig. 2). 

The transitional unit microfossils are dominated by clearly reworked 
Maastrichtian foraminifera and nannoplankton, known across the 
Gulf of Mexico and Caribbean as the Cretaceous/Palaeogene (K/Pg) 
boundary cocktail'? (Extended Data Fig. 3 and Supplementary Table 1). 
Although overall foraminiferal abundance (plotted as the number of 
foraminifera per gram of sedimentary rock; Fig. 1) is high at the base 
of the unit, species known to range across the boundary (‘survivor 
species’) are rare in the lower transitional unit and become more com- 
mon up-section even as total foraminifera decline (Fig. 1). Survivor 
species, here defined as G. cretacea, Muricohedbergella monmouthensis 
and Muricohedbergella holmdelensis”, dominate a depauperate assem- 
blage in the upper 20 cm of the transitional unit, coinciding with the 
first appearance of trace fossils (Extended Data Figs. 4, 5). 

The nannofossil assemblage in the transitional unit contains 
reworked Cretaceous specimens, including a group of clearly overgrown 
species (such as Aspidolithus parcus (also known as Broinsonia parca) 
and Eiffellithus eximius) that became extinct near the Campanian/ 
Maastrichtian boundary. The remainder of the Cretaceous species, 
which dominate the assemblage, range to the top of or beyond the latest 
Maastrichtian age (Supplementary Table 2). Unusually small (less than 
2m) and delicate specimens of Micula are observed throughout the 
transitional unit and increase in abundance up-section (Fig. 1), along 
with small Retecapsa (Extended Data Fig. 6). Taxa common at other 
sites of the earliest Danian stage are also present, including disaster gen- 
era (opportunistic groups that can tolerate high environmental stress) 
such as Thoracosphaera and Braarudosphaera. Unlike the foraminif- 
era, there are no clear stratigraphic trends in overall nannoplankton 
abundance (Fig. 1). 

Because survivor species lived both before and after the K/Pg mass 
extinction, it is impossible to determine for certain whether individual 
specimens in the transitional unit colonized the crater after the impact. 
However, the populations of foraminifera and nannoplankton are 
substantially different from those of the latest Cretaceous’? (that is, the 
expected population if the whole assemblage was reworked), suggesting 
that these taxa were true survivors (Fig. 1 and Extended Data Fig. 6). 
G. cretacea, a common component of the survivor assemblage in the 
upper transitional unit, was restricted to marginal marine waters during 
the Maastrichtian and would not have been present at the pre-impact 
site, which was over 100m deep”! and over 500km from shore”. The 
nannofossil assemblage in the transitional unit is considerably differ- 
ent from typical latest Maastrichtian assemblages, with some genera 
over-represented (Watznaueria and Retecapsa) and others under- 
represented (Eiffellithus, not including E. eximius, Arkhangelskiella, 
Chiastozygus and Prediscosphaera) (Extended Data Fig. 6). Additionally, 
Micula—a robust taxon often used as a proxy for dissolution— 
is not as abundant as elsewhere, indicating that these unusual 
abundances are not due to poor or selective preservation (Extended 
Data Fig. 6). 

This initial appearance of life is notably fast, especially because cra- 
ter-specific factors do not seem to have had a negative effect on the 
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Fig. 3 | Early Danian foraminifer abundances and I/(Ca + Mg) 
oxygenation proxy. Left plot, Key Danian planktic foraminifera. 

Normal perforate planktic foraminifera (Eoglobigerina, Globanomalina, 
Parasubbotina and Praemurica) are rare throughout the study interval 
and not plotted here; all are plotted as a percentage of total planktic 
foraminifera. Right plot, I/(Ca+ Mg) redox proxy, indicating well- 
oxygenated conditions in the Chicxulub crater through this interval. PFZ, 
planktic foraminfer zone. 


local recovery of life. A vigorous, high-temperature hydrothermal sys- 
tem was established within the crater and may have persisted for mil- 
lions of years after the impact”, especially across the peak ring where 
rocks exhumed from deep in the crust were extensively fractured’. 
Nevertheless, the appearance of burrowing organisms within years of 
the impact indicates that the hydrothermal system did not adversely 
affect seafloor life. Impact-generated hydrothermal systems are hypoth- 
esized to be potential habitats for early life on Earth” and on other 
planets, particularly below the surface. However, for marine impact 
craters in open ocean communication, such as Chicxulub (Extended 
Data Fig. 1), our data indicate that locally substantial but compara- 
tively small volumes of hydrothermal fluids were overwhelmed by the 
1.3 x 10*km? of well-mixed ocean water that filled the basin. 

Likewise, the open connection with the Gulf of Mexico prevented the 
development of anoxia in the crater. Our analyses of I/Ca ratios suggest 
that local dissolved oxygen was high and stable in Zone Pa (Fig. 3). 
This is in contrast to the smaller (85-km wide) Eocene Chesapeake 
Bay impact crater, where anoxia due to restriction is attributed as the 
cause of delayed recovery of the benthic ecosystem on the crater floor”. 
This comparison suggests that the establishment of life within marine 
impact craters is controlled more by circulation (and thus crater geom- 
etry) than by the magnitude of the impact or global environmental 
effects. 

The overlying pelagic limestone, which was deposited within Zone 
Pa (30-200 kyr after the impact) contains abundant evidence of 
high productivity in a thriving ecosystem. The assemblage of plank- 
tic foraminifera in Zone Pa is diverse and abundant (Fig. 3). Good 
preservation in the lowermost sample (34cm core depth) enabled the 
identification of over 60 species of benthic foraminifera, and benth- 
ics make up 12% of the total foraminiferal assemblage at this level 
(Supplementary Table 1). This percentage of benthics”° and the over- 
all benthic assemblage”’ are both typical of a palaeo-water depth of 
about 600-700 m (around the boundary between the upper and mid- 
dle bathyal zones) 10,27 At the base of the white limestone, trace fossils 
increase in size, abundance and diversity relative to the underlying 
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transitional unit. The abundance and diversity of benthic organisms 
indicate that by about 30 kyr after the impact, seafloor conditions had 
returned to normal and sufficient organic matter flux existed to sustain 
a diverse, multilayer benthic community. 

Conversely, the nannoplankton assemblage in the Danian limestone 
is dominated by Braarudosphaera and calcareous dinoflagellate cysts 
(for example, Thoracosphaera), which are common disaster taxa in the 
early recovery interval. Large, foraminifer-sized calcispheres appear 
after about 100 kyr. Calcareous phytoplankton in the earliest Danian 
clearly represent a low-diversity, high-productivity bloom. Genera such 
as Neobiscutum and Prinsius, which are common bloom taxa in the 
recovery interval at other Northern Hemisphere sites, do not become 
common until several metres higher in the section, over one million 
years after the impact. Organic microfossils are completely absent from 
the study interval, probably owing to poor preservation of organic 
material. 

Geochemical palaeoproductivity proxies, particularly Ba/Ti and Ba/ 
Fe ratios, also indicate high productivity in the post-impact Danian 
limestone (Fig. 1). Ba/Ti ratios of about 1.0 at the base of the limestone 
(approximately 30 kyr after the impact) and about 2.0 above that (15cm 
higher, or about 100 kyr after the impact) indicate relatively high and 
increasing productivity in the Chicxulub basin in the earliest Danian. 

The recovery of productivity in the crater is faster than that at many 
sites, including those in the Gulf of Mexico, some of which took 300 kyr 
or more to recover to a similar extent®"!. Therefore, we find that prox- 
imity to the impact was not a control on recovery in marine ecosystems. 
The wide range of rates of recovery in the oceans show no relationship 
with geographic distance to the crater and so are best explained by 
natural ecological interactions, such as incumbency and competitive 
exclusion, between organisms within recovery ecosystems®"*, These 
trends can be used to understand the rates of recovery after other major 
extinction events and to predict the long-term recovery of modern 
ecosystems affected by pollution and climate change. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0163-6. 
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METHODS 


Sample size was determined according to standard community practice (collecting 
approximately 300 specimens per sample, when possible). No statistical methods 
were used to predetermine sample size. The experiments were not randomized 
and investigators were not blinded to allocation during experiments and outcome 
assessment. 

The IODP-ICDP Expedition 364 drilled the peak ring of the Chicxulub crater in 
the spring of 2016 (Extended Data Fig. 1). Samples were taken at the Bremen IJODP 
core repository during the Expedition 364 sampling party. Core depth in centime- 
tres—with zero at the top of the section (616.24 m below sea floor)—are reported 
throughout. Core material was indurated, and ~0.5-cm quarter-rounds were cut 
out with a rock saw. Owing to the need to reserve core material for rare earth 
element geochemistry (data not shown), the lowermost ~1.5 cm of the Danian 
limestone was not sampled. Individual samples were subdivided for foraminifera, 
calcareous nannoplankton and discrete geochemical analyses. 

Forty-three samples were examined for planktic and benthic foraminifera from 
core 40 from 0-110.cm depth. Samples were weighed, crushed with a mortar and 
pestle, soaked overnight (or longer) in a 10% solution of hydrogen peroxide buff- 
ered with borax and washed over a 43-1m sieve to ensure capture of small Danian 
taxa. The sieve was soaked in methylene blue dye between samples to identify 
contaminated specimens. Samples were then dried in an oven, split to obtain a 
manageable volume of material, and examined for foraminifera, calcispheres, and 
other sand-sized particles. In the Danian limestone, at least 300 specimens were 
counted to establish a statistically robust population” and the rest of the residue 
was then examined for biostratigraphically important taxa. Low abundances in the 
transitional unit precluded 300-specimen counts. However, we demonstrate that 
our values are sufficient to reject the null hypothesis (that the observed enrich- 
ments in survivor taxa are the result of random noise) with binomial confidence 
limits. This calculation traditionally provides the basis for the 300-specimen ‘rule’: 
counting 300 specimens provides statistical confidence at a 95% confidence interval 
that a species that makes up 1% of the population is represented in the count”®. As 
we show, fewer specimens are sufficient to demonstrate the presence of a survivor 
population in our samples. Binomial confidence limits for samples with fewer than 
300 specimens are reported in Supplementary Table 1. Additionally, a single unusu- 
ally well-preserved sample at the base of the post-impact limestone was examined 
for rare benthic species to determine the true diversity of benthic foraminifera at 
the base of the unit (Supplementary Table 1). Planktic foraminifer biozonation 
follows the P zones of Berggren and Pearson”? as modified by Wade et al.!®. 

Ninety-seven samples were examined for nannofossils. Samples were disag- 
gregated in water, and smear slides were made from the supernatant. Slides were 
observed in a transmitted light microscope at 1,600 until at least 100 specimens 
were observed (Supplementary Table 2). Standard taxonomy was applied (http:// 
www.mikrotax.org/Nannotax3/index.php?dir=Coccolithophores). The abun- 
dance of taxa at site M0077 was compared to a previous compilation of global K/ 
Pg nannoplankton”?. 

Ichnological analysis was conducted from 0-110 cm. Ichnological observations 
were conducted on core material and a detailed and continuous analysis of digital 
images. To improve visibility of ichnological features, images were treated by a 
digital image methodology, based on the modification of image adjustments as 
levels, brightness and vibrance*”*!. Ichnotaxonomical classification of trace fossils 
was based on the overall shape and the presence of diagnostic criteria such as size 
and presence of branches™. Special attention was given to the infilling material of 
biogenic structures. 

The measurement of I/(Ca + Mg) was carried out using a procedure similar 
to a previously described method*’. For each sample and geostandard, approxi- 
mately 3-4 mg of carbonate powder was weighed out, dissolved in ~0.45 M nitric 
solution and then diluted using 0.1 M nitric acid and 0.5% TMAH solution. All 
reported measurements are from samples that had a matrix of 50 + 5 p.p.m. cal- 
cium solution to ensure the most precise iodine measurement. Dissolved samples 
had TMAH solution added within an hour to avoid any possible loss of volatilized 
iodine**. Samples were measured using an Agilent inductively coupled plasma 
mass spectrometer 7500 cs housed within the geochemistry group of the National 
High Magnetic Field Laboratory at Florida State University. A previously reported 
known sample, Key Largo (KL 1-1) was used to ensure reliable reproducibility. 
Our value of 5.51 ,1mol/mol was within error of the reported value of 5.55 jzmol/ 
mol. A previous study** found that generally low oxygen conditions correspond 
to ~2.6 mol/mol for I/(Ca+ Mg). Values are reported in Supplementary Table 3. 

Section 1 of core 40 was scanned with an AVAATECH XRF Core Scanner II 
at MARUM (Bremen, Germany) during the onshore phase of Expedition 364 
(Fig. 1). The split core was covered with a 4-\1m-thick SPEX CertiPrep Ultralene 
foil to avoid contamination. X-ray fluorescence data were acquired with a Canberra 
X-PIPS silicon drift detector with 1550 eV resolution, a Canberra DAS 1000 dig- 
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ital spectrum analyser and an Oxford Instruments 50 W XTFO11 X-ray tube with 
rhodium target material. X-ray spectra were processed with WIN AXIL software 
from Canberra Erisys at a resolution of 12mm and a step of 10mm. Scans were 
conducted at different voltages to determine a range of element concentrations: 
50kV, with a beam current of 1 mA (Ba and Sr; average dead time of 5%), and 10kV 
with a beam current of 0.15 mA (Al, Si, K, Ca, Ti, Fe, Mn and S; average dead time 
of 11%). For each scan, sampling time was 20s per spot. 

3He is delivered to the Earth’s surface by cosmic dust grains and over short time 
spans (about one million years) can be used as a constant flux proxy*°. Previous 
work has shown that the K/Pg impactor was not associated with enhanced *He flux, 
and the mean extraterrestrial *He flux from cosmic dust accretion at the end of the 
Cretaceous (106 x 10-15 cc (standard temperature and pressure) per g per cm” per 
kyr) was used to estimate the duration over which the K/Pg boundary clay was 
deposited at Gubbio and El Kef*°. We use a similar approach here to establish the 
sedimentation rate of the transitional unit, which we use to develop an age model. 

Helium isotope ratios and concentrations were measured on ~1-g aliquots of 
sediment following standard analytical procedures*!. Extraterrestrial 7He con- 
centrations were computed from measured He isotopic compositions using an 
isotopic deconvolution model**. Results are shown in Extended Data Table 1. *He 
concentrations and *He/*He ratios are generally low compared to typical marine 
sediments of similar age*”*. Nevertheless, with the exception of the lowest sample 
in the transitional unit (106.5 cm), the fraction of *He attributable to an extraterres- 
trial source is high, ranging from ~0.70 to 0.96. The deepest sample has a similar 
3He concentration to other samples in the transitional unit, but ~5 times more 
‘He. This elevated ‘He probably arises from a higher concentration of terrigenous 
4He-bearing material deposited rapidly after the impact. 

We see no evidence for extraterrestrial He carried in impactor fragments, such 
as highly elevated and/or highly variable *He and *He/*He ratios. The absence of 
such a signal is consistent with either (a) the absence of impactor fragments in 
the material analysed or (b) the loss of extraterrestrial 3He from the impactor via 
heating, vaporization or fusion. Note that, unlike many tracers of the impactor 
(such as Ir), deposition of fused or vaporized impactor will leave no trace in the 
sedimentary record because once He is lost into the atmosphere, it can no longer 
be retained in sediments. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. X-ray fluorescence data have previously been published*® and 
are available online (https://doi.org/10.14379/iodp.proc.364.2017). All other 
data supporting the findings of this study are available within the paper and 
its Supplementary Information. 
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Extended Data Fig. 1 | Location of site M0077 in the Chicxulub crater as seen using gravity data. Black dots are cenotes. Modified from Gulick et al. 7". 
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Extended Data Fig. 3 | Reworked Cretaceous foraminifera in the e, Racemiguembelina powelli, sample 364-M0077A-40R-1-W, 118-129 cm. 


transitional unit. a, Globigerinelloides sp., sample 364-M0077A-40R- f, Globotruncana bulloides, sample 364-M0077A-40R-1-W, 110-118 cm. 
1-W, 55-56 cm. b, Heterohelix sp., sample 364-M0077A-40R-1-W, g, Globotruncanita stuartiformis, sample 364-M0077A-40R-1-W, 
104-105 cm. ¢, Clast of pelagic limestone containing older Cretaceous 118-129 cm. h, Globotruncanita elevata, sample 364-M0077A-40R-1-W, 
planktic foraminifera, sample 364-M0077A-40R-1-W, 106-110.cm. d, 118-129 cm. Scale bars, 100 1m. 


Praegublerina pseudotessera, sample 364-M0077A-40R-1-W, 118-129 cm. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Extended Data Fig. 4 | Scanning electron micrographs of planktic Parvularugoglobigerina eugubina 364-M0077A-40R-1-W, 31-32 cm. i, 
foraminifera from core 40. a, b, Examples of common reworked P. eugubina, sample 364-M0077A-40R-1-W, 31-32 cm. j, Globoconusa 
Cretaceous biserials, sample 364-M0077A-40R-1, 102-103 cm. c, daubjergensis, sample 364-M0077A-40R-1-W, 31-32 cm. k, Eoglobigerina 
Muricohedbergella monmouthensis, sample 364-M0077A-40R-1-W, eobulloides, sample 364-M0077A-40R-1-W, 29-30 cm. 1, Eoglobigerina 
102-103 cm. d, Muricohedbergella holmdelensis, sample 364-M0077A- edita, sample 364-M0077A-40R-1-W, 29-30cm. m, Praemurica taurica, 
40R-1-W, 44-45 cm. e, Guembelitria cretacea, sample 364-M0077A- sample 364-M0077A-40R-1-W, 10-11 cm. n, Chiloguembelina morsei, 
40R-1-W, 44-45 cm. f, G. cretacea, sample 364-M0077A-40R-1-W, sample 364-M0077A-40R-1-W, 10-11lcm. 


29-30cm. g, G. cretacea, sample 364-M0077A-40R-1-W, 29-30 cm. h, 
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Extended Data Fig. 5 | Small and regular-sized nannofossils in the 


j, 100-101 cm; k, 81-82 cm. 1-q, Images of regular-sized Micula spp.:1, 
transitional unit. All photographs from core 364-M0077-40R-1-W. 


44-45 cm; m, 41-42 cm; n, 51-52. cm; 0, 105-106 cm; p, 97-98 cm; q, 
Measurements in centimetres refer to depth in section 1 of core 40. a-k, 36-37 cm. s, t, Images of regular-sized Retecapsa spp.: s, 85-86 cm; t, 100- 
Images of small Micula spp.: a, 55-56 cm; b, 41-42 cm; c, 95-96 cm; d, 101 cm. r-v, Images of small Retecapsa spp.: r, 100-101 cm; u, 71-72 cm, v, 
41-42 cm; e, 90-91 cm; f, 94-95 cm; g, 91-92 cm; h, 91-92 cm; i, 45-46 cm; 100-101 cm. Scale bar, 21m. 
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Extended Data Fig. 6 | Relative abundances of major Maastrichtian unit at site M0077. These data demonstrate the unusual abundance of 
calcareous nannoplankton. Small blue squares are Maastrichtian sites Watznaueria and Retecapsa at site M0077. 
from a global compilation”; larger red squares are from the transitional 
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Extended Data Table 1 | #He data 


Sample 
KT39 
KT48 
KT59 
KT68 
KT79 
KT89 


KT99 


KT106.5 


start 


cm 


39 


48 


59 


68 


79 


89 


99 


107 


stop 


cm 


40 


49 


60 


69 


80 


90 


100 


108 


3He 

pcc/g 

0.0068 
0.0055 
0.0064 
0.0042 
0.0036 
0.0105 
0.0045 


0.0109 


4He 
ncc/g 
13.6 
35.4 
23.1 
31.6 
18.3 
34.7 
64.3 


327 


Absolute Fraction 


3He/*He 
5.04E-07 
1.56E-07 
2.78E-07 
1.33E-07 
1.99E-07 
3.04E-07 
6.99E-08 


3.32E-08 


3He ET 


0.96 


0.87 


0.92 


0.84 


0.9 


0.93 


0.70 


0.37 


Maximum ?He -Based 
Model Age (kyr) 
6.0 
4.9 
4.0 
2.9 
1.9 
0.9 
0.1 


0.0 
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Parallel emergence of stable and dynamic memory 
engrams in the hippocampus 


Thomas Hainmueller)2? & Marlene Bartos!* 


During our daily life, we depend on memories of past experiences 
to plan future behaviour. These memories are represented by the 
activity of specific neuronal groups or ‘engrams’’”. Neuronal 
engrams are assembled during learning by synaptic modification, 
and engram reactivation represents the memorized experience. 
Engrams of conscious memories are initially stored in the 
hippocampus for several days and then transferred to cortical areas”. 
In the dentate gyrus of the hippocampus, granule cells transform 
rich inputs from the entorhinal cortex into a sparse output, which 
is forwarded to the highly interconnected pyramidal cell network in 
hippocampal area CA3°. This process is thought to support pattern 
separation‘ (but see refs. »°). CA3 pyramidal neurons project to 
CA1, the hippocampal output region. Consistent with the idea of 
transient memory storage in the hippocampus, engrams in CA1 
and CA2 do not stabilize over time’~!°. Nevertheless, reactivation of 
engrams in the dentate gyrus can induce recall of artificial memories 
even after weeks”. Reconciliation of this apparent paradox will 
require recordings from dentate gyrus granule cells throughout 
learning, which has so far not been performed for more than a 
single day®*!!!?, Here, we use chronic two-photon calcium imaging 
in head-fixed mice performing a multiple-day spatial memory task 
in a virtual environment to record neuronal activity in all major 
hippocampal subfields. Whereas pyramidal neurons in CA1-CA3 
show precise and highly context-specific, but continuously changing, 
representations of the learned spatial sceneries in our behavioural 
paradigm, granule cells in the dentate gyrus have a spatial code 
that is stable over many days, with low place- or context-specificity. 
Our results suggest that synaptic weights along the hippocampal 
trisynaptic loop are constantly reassigned to support the formation 
of dynamic representations in downstream hippocampal areas 
based on a stable code provided by the dentate gyrus. 

To study hippocampal memory engrams during long-term learning, 
we designed a goal-oriented learning task for head-fixed mice. Mice 
ran on a spherical treadmill to collect soy milk rewards on a 4-m-long 
virtual linear track displayed on monitors around the animal. After at 
least 10 days of familiarization to this track (familiar context), imaging 
sessions started in which mice ran alternatingly on this familiar and 
a visually different, novel track with different reward sites (Fig. 1a, b, 
Supplementary Video 1). Animals consistently licked more often inside 
than outside reward zones on both tracks (Fig. 1d). Initially, overall 
licking and reward-related licking were lower in the novel context than 
in the familiar context (Extended Data Fig. 1c, d). These differences 
vanished with learning. On the novel track, the ratio of rewarded 
to erroneous licks increased markedly on the second training day 
(Fig. 1d), indicating that mice remembered the rewarded locations. 

To measure hippocampal neuronal activity, mice were injected 
with adeno-associated viruses designed to express the fluorescent 
calcium indicator GCaMPé6f pan-neuronally in CA1 and the dentate 
gyrus (DG) or CA3 (Fig. Ic). A chronic transcortical imaging win- 
dow was implanted above CA1 to perform two-photon imaging of 
CAI or DG neurons"? (Supplementary Video 2). Implantation did not 


impair spatial learning in a Barnes maze (Extended Data Fig. 1h, i). 
CA1 and DG neurons were imaged at depths of around 150,1m and 
around 700 .m, respectively. To image CA3, we implanted a more 
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Fig. 1 | Two-photon calcium imaging of hippocampal place cell activity 
in a virtual environment. a, Experimental schematic. b, Behaviour 
timeline (see Methods). c, Left, CA1 and DG (top) and CA2/3 (bottom) 
implantation sites. GCaMP6f (white) and td-Tomato (tdT; red) in 
parvalbumin (PV)-expressing interneurons. Dotted lines, imaging planes. 
Right, GCaMP6f and tdT fluorescence in vivo. d, Ratio between rewarded 
and non-rewarded licks in the familiar (fam, filled circles) and novel (nov, 
open circles) contexts (n = 15 mice; two-sided signed rank-sum test). 

e, Calcium traces (grey) with significant transients (red; see Methods) of 
a GC and linear-track position (blue) over time. Right, calcium activity 
over track distance of the same GC. f, Top, fraction of active (more than 
0.05 transients per s) cells among all neurons. Test for population overlap 
(x? test). Bottom, cells with place fields among active cells. g, Spatial 
information for all familiar-track-active neurons (ANOVA on ranks, 
Dunn’ test). Boxes, 25th to 75th percentiles; bars, median; whiskers, 99% 
range. NS, not significant; ***P < 0.001. Error bars denote s.e.m. For exact 
P values see Supplementary Table 1. 
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Fig. 2 | Pyramidal cells in CA1 and CA2/3 discriminate between 
contexts. a, Activity difference scores (see Methods) between novel and 
familiar contexts for all cells (two-sided signed rank-sum test: novel versus 
familiar activity). b, Mean spatial information of active cells (two-sided 
rank-sum test). c, Familiar-track place cell activity plotted for the first 
(left) and second (middle) block of familiar-context runs and for the novel 
context (right). d, Mean activity correlations of place cells within familiar 


lateral window (Fig. 1c). Data were obtained predominantly from 
CA3 (Extended Data Fig. 2d), but some CA2 cells may also have been 
included". In all cases, we used fast volumetric scanning to simul- 
taneously record about 500 neurons (see Methods, Supplementary 
Videos 3-5). 

We first analysed neuronal activity in the familiar and novel con- 
texts (Fig. 1). Consistent with previous findings'?!*’>!°, pyramidal cells 
(PYRs) in CA1-CA3 were substantially more active than granule cells 
(GCs) (Fig. 1f, Extended Data Fig. 3a, b). We also determined the fraction 
of cells that was active in the familiar, novel or both contexts with more 
than 0.05 calcium transients per second. Activation of hippocampal 
neurons might be predetermined by intrinsic properties’. In line with 
this idea, we found a marked overlap of active neuronal ensembles 
between contexts (Fig. 1f, upper row). About 35% of these active neu- 
rons had a clearly defined place field (see Methods) on the first record- 
ing day in either the novel or the familiar context, or both (Fig. 1f, 
lower row). Because many neurons were active in both contexts, we 
investigated whether active neurons were more likely to have a place 
field in both contexts. However, the familiar- and novel-context place 
cells appeared to form independent subgroups within the active cell 
population (Fig. 1f, lower row) indicating that a separate place-coding 
group or ‘engram’ might exist for each context. Further comparison of 
spatial coding properties revealed lower average spatial information 
(see Methods) per active cell (Fig. 1g) and wider place fields (Extended 
Data Fig. 3c) in GCs compared to CA1-3 PYRs. Thus, GC activity is 
sparse and has broader and less precise spatial tuning than PYR activity. 

Next, we compared neuronal activity between contexts (Fig. 2). 
Consistent with a previous study’ and their inputs from the entorhinal 
cortex'§, mean activity in GCs decreased in the novel context, whereas 
mean activity of CAl and CA2/3 PYRs increased (Fig. 2a). Similarly, 
novel-context spatial information was markedly lower in the DG, but 
higher in both CA regions (Fig. 2b). Additionally, there was a trend 
towards higher place cell numbers on the familiar track than on the 
novel track, particularly in the DG (n = 30.50 versus 16.42 place cells 
per experiment, 12 experiments, P= 0.066, paired t-test; Extended 
Data Fig. 3f). We next investigated the cause of these activity differ- 
ences between contexts. Hippocampal ~7\-aminobutyric acid (GABA) 


= om 
QO AF/F 0.35 


(left) and novel context (middle) runs and between contexts (right bars; 
ANOVA on ranks, Dunn’s test). e, PoV correlations (see Methods) for 
all experiments within familiar-context runs and between contexts (thin 
lines). Thick lines denote mean + s.e.m. (two-sided paired t-test). Boxes, 
25th to 75th percentiles; bars, median; whiskers, 99% range. *P < 0.05, 
**P < 0.01, ***P < 0.001; NS, not significant. For exact P values see 
Supplementary Table 1. 


interneurons contribute to separation of memory engrams!? and 
formation of place fields'*. We therefore analysed the activity of parval- 
bumin (PV)-expressing interneurons (PVIs; Extended Data Fig. 4), the 
most abundant subtype of interneurons in the hippocampus. PVI activ- 
ity in CA1 and the DG correlated positively with running speed!*”° 
(Extended Data Fig. 4c—h). PVIs in the DG, but not in CA1, showed 
decreased activity in the novel context (Extended Data Fig. 4i-l), 
contrasting with reports from unidentified DG interneurons!°. Thus, 
our data argue against suppression of GCs by enhanced PVI activity, 
and are instead consistent with predominant recruitment of DG PVIs 
by local GC inputs. 

To probe neuronal discrimination between contexts, we first deter- 
mined the consistency of place cell firing on the same track between 
the first and the second block of five consecutive runs. Place cell con- 
sistency in the familiar context was high in all hippocampal subfields 
(Fig. 2c, d; F_-F’). The same measure and trial-to-trial reliability were 
generally lower for novel-context runs, indicating an initially less reli- 
able representation (Fig. 2c, d; N-N’; Extended Data Fig. 3i). Next, 
we quantified place cell remapping between the familiar and novel 
contexts. Unexpectedly, activity map correlations between contexts 
were substantially higher for DG place cells than in CA1 and CA2/3 
(Fig. 2c, d; F-N). We confirmed this finding separately in two mice by 
imaging neuronal activity in CA1 and DG of the same mice (Extended 
Data Fig. 5). Thus, DG place cell activity was similar between contexts, 
whereas that of CA1-3 place cells was highly discriminative. We also 
calculated population vectors (PoVs) for both contexts from the mean 
calcium activity maps of all cells. PoVs were significantly more dissimi- 
lar between contexts as compared to independent runs within the famil- 
iar context in CA1 and CA2/3 (P=0.004, both regions, paired t-test), 
but not in the DG (P=0.051, Fig. 2e). Indeed, activity map correlations 
between contexts were markedly lower in CA1 and CA2/3 than in the 
DG, indicating stronger remapping in CA1-3. GCs might encode trav- 
elled distance and therefore show low context-selectivity. To test this 
possibility, we let mice run on a simplified linear track with striped 
walls but no further contextual information (Extended Data Fig. 6a). 
Under these conditions, GC activity and spatial information were low, 
and GCs did not show consistent place fields (Extended Data Fig. 6b, c), 
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Fig. 3 | GC place fields are highly stable across days. a, Illustrative 
example of CA1 cells imaged on subsequent days. b, Activity of familiar- 
track place cells sorted for day 1. c, As in b for novel-track place cells. 

d, Left, experimental schematic. Right, activity map correlations between 
days for all day 1 (d1) place cells (ANOVA on ranks, Dunn's test). 

e, Activity map correlations between days for familiar-context (left) and 
novel-context (right) place cells. f, Mean trial-to-trial reliability of 
novel-track place cell responses on days 1 and 2. g, Activity map 
correlations between contexts of day 1 (left) and day 2 (right) place cells. 
e-g, Two-sided rank-sum test. Boxes, 25th to 75th percentiles; bars, 
median; whiskers, 99% range. *P < 0.05, ***P < 0.001; NS, not significant. 
For exact P values see Supplementary Table 1. 


indicating that they encode the general task layout rather than mere 
distance. Thus, GCs show reliable place representations and low con- 
text discrimination, whereas CA2/3 PYRs and CA1 PYRs prominently 
encode contextual differences. 

To investigate place field stability throughout learning, we imaged the 
same cells in both contexts on two subsequent days (Fig. 3, Extended 
Data Fig. 7). Whereas GCs maintained their place field locations in 
the same context, CAl PYRs and CA2/3 PYRs displayed substantial 
remapping (Fig. 3b-d). This was characterized by lower activity map 
correlations (Fig. 3d, e) and larger shifts of the preferred firing loca- 
tion (Extended Data Fig. 7c). Despite the generally high GC place field 
stability, activity map correlations between days were lower for GCs 
encoding the novel context than for those encoding the familiar con- 
text. By contrast, hippocampal PYRs showed similarly low stability in 
both contexts (Fig. 3e). Thus, GCs have stable place fields, whereas 
place fields in other hippocampal subfields change over days. 

Place cell stability in CAl may depend on environmental 
complexity’. We therefore repeated our experiments in a virtual con- 
text without distal visual cues (‘poor’) and a highly enriched, multi- 
sensory track (‘rich’; Supplementary Video 6). Notably, the number 
of place cells was similar between all tracks, but their firing rate, 
spatial information and day-to-day stability were markedly reduced 
on the ‘poor’ track (Extended Data Fig. 8). We observed no differences 
between the ‘rich’ track and our standard contexts, indicating that 
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CA1 place cell representations are also dynamic over days in complex 
environments. 

Next, we investigated learning-induced changes in spatial coding. 
From day 1 to day 2, there was a substantial increase in the trial-to-trial 
reliability of place cells in CA1-3, but not in the DG (Fig. 3f, Extended 
Data Fig. 7d). The DG is required for context discrimination>*. 
We therefore tested whether neuronal activity became more distinct 
between contexts with learning. Unexpectedly, activity correlations 
between contexts were unchanged in GCs from day 1 to day 2 but 
were lower for CA1 place cells on day 2 and remained negative in 
CA2/3 (Fig. 3g). Thus, improved behavioural context discrimination 
was accompanied by a decorrelation of place cell activity in CA1, but 
not in the DG. 

In light of recent findings”’, we compared place coding between the 
deep and superficial sublayers of CA1. Notably, place field stability 
across days was slightly higher in deep-layer PYRs (45% difference, 
P=0.046; Extended Data Fig. 9i), albeit at generally low levels. Place 
field reliability and context discrimination were comparable between 
sublayers (Extended Data Fig. 9g, h). 

To investigate the development of spatial representations over the 
time-course of hippocampus-dependent memory”, we continued imag- 
ing sessions for five subsequent days (Fig. 4, Extended Data Fig. 10). 
Whereas the number of place cells was similar for each context and 
day (Fig. 4c, white numbers), their firing locations changed markedly 
in some hippocampal sub-areas. Familiar-context place fields of GCs 
remained stable throughout all days (Fig. 4a-c) and novel-context place 
cells showed gradually increasing stability (Fig. 4b, c, Extended Data 
Fig. 10b). By contrast, CA1 and CA2/3 activity in the same contexts 
became rapidly more dissimilar over days. For CA2/3 cells, activity 
map correlations over more than two days dropped below chance 
levels (Fig. 4d), demonstrating that these neurons constantly remap 
their place fields. 

Dynamic coding has been described in CA1”~°!, CA2” and other 
associative areas”. A gradual variation of active CA1 ensembles links 
contextual memories acquired close in time”*! and remapping of indi- 
vidual PYRs is driven by synaptic plasticity'>**. By contrast, neuronal 
ensemble activity in motor areas stabilizes throughout learning”. In 
the hippocampus, temporally stable coding of GCs may induce het- 
erosynaptic plasticity at CA3 PYR dendrites by associating their activity 
with temporally dynamic inputs from the entorhinal cortex”? or other 
CA3 PYRs”. This hypothesis would explain why CA3 ensembles can 
trigger memory recall independent of GC input even when the DG is 
required for initial task learning’””*. Our results, together with previ- 
ous findings*’°, indicate that CA3 coding can be dynamic or stable, 
potentially depending on the behaviour, proximo-distal location within 
CA3 (Extended Data Fig. 2e), virtual versus real-world navigation or 
species differences in entorhinal cortex innervation?’. When CA3 
is stable, a mechanism similar to that described above may apply at 
CA3-CAI synapses. 

Traditionally, similar memories are thought to be represented 
by largely non-overlapping populations of GCs*. However, recent 
findings indicate that GCs remap only between widely dissimilar 
environments’', while other cell types (for example, mossy cells) 
discriminate between similar contexts!*. Accordingly, CA2/3 PYR activ- 
ity was most discriminative between our virtual contexts (Fig. 2d, e). 
The high similarity of our—mostly mature’—GC activity between 
contexts may explain why mature GCs mediate generalization between 
similar contexts rather than pattern separation’. 

Our results further suggest that the hippocampus combines stable 
and dynamic coding and reunites findings of temporally varying 
neuronal ensembles encoding the same environment”* with reports of 
stable behavioural output upon DG engram reactivation over weeks”””. 
Given that the DG is required for the extinction or modification of 
existing memories acquired in the same scenery*®”°, our data support 
the hypothesis that GCs provide a simplistic but stable representation 
of the global environment!!"” that serves as a blueprint for spatially 
and contextually precise, but temporally varying, CA1-3 engrams 
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Fig. 4 | Stable coding of GCs persists throughout multiple days, 
whereas CA2/3 PYRs constantly remap over time. a, Activity maps of 

all familiar-context place cells sorted by day 3. b, Development of activity 
map correlations between subsequent days for familiar- (dark) and novel- 
context (light) place cells (numbers show n; ANOVA on ranks, Dunn's test; 
mean +s.e.m.). c, Mean activity map correlations (colour coded; Pearson's R) 
over 5 days and two contexts as indicated on the x-axis. Each row shows 
mean correlation values for cells that had a place field on the day and track 
indicated on the y-axis (white numbers show n). d, Mean activity map 


(Fig. 4e). Such an encoding scheme would enable one to associate 
memories acquired in the same global environment but still to dis- 
criminate between slightly different or temporally separate instances 
of these memories. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0191-2. 
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METHODS 


Mice. All experiments involving animals were carried out according to national 
and institutional guidelines and approved by the “Tierversuchskommission of the 
Regierungsprasidium Freiburg (license no. G16/037) in accordance with national leg- 
islation. B6;129P2-Pvalb'"(4"7/] mice (PV-Cre; The Jackson laboratory) crossed 
with B6.Cg-Gt(ROSA)26Sor'"!?(CAC-td Tomato) Hze/7 mice (Ai9-reporter; The Jackson 
laboratory) were used for all experiments at an age of 7-15 weeks (n=6 for DG, 5 
for CA2/3, 11 for CA1 and 7 for the ‘poor-rich—-novel’ (Extended Data Figs. 6, 8) 
recordings). Age-matched C57/Bl6 mice injected with GFP virus were used as 
control group for Barnes maze experiments. Mice were housed on a 12-h light- 
dark cycle in groups of 2-5 mice. After the start of the post-window-implantation 
training and food restriction, mice were housed individually. No statistical methods 
were used to predetermine sample size. The experiments were not randomized and 
the investigators were not blinded to allocation during experiments and outcome 
assessment. 

Virus injections and head plate implantation. All surgical procedures were 
performed in a stereotactic apparatus (Kopf instruments) under anaesthesia 
with 1-2% isoflurane and analgesia using 0.1 mg kg~! buprenorphine. A small 
(~0.5-1mm diameter) craniotomy was made over the hippocampus and 500 nl 
AAV1.Syn.GCaMP6f.WPRE.SV4 (titre 4.65 x 10’? vg (viral genomes) per ml; 
University of Pennsylvania Vector Core) were injected into CA3 (A/P —1.7mm; 
M/L 1.9mm; D/V —1.9mm) or DG and CAI (A/P —2.0mm; M/L 2.0mm; D/V 
—2.0 and/or — 1.4mm). GFP controls were injected with equal amounts of AAV1. 
CAG.GEP (titre 3.2 x 10!” vg per ml; University of North Carolina Vector core) 
with the same coordinates. In the same surgery session, mice were implanted with 
a stainless-steel head plate (25 x 10 x 0.8mm with an 8-mm central aperture). 
The head plate was oriented horizontally for CA1 and DG imaging implantations 
and with a 20° lateral angle for CA3 imaging implantations. Mice were allowed 
to recover from surgery for at least 5 days before training sessions commenced. 
Postoperative analgesic treatment was continued with carprofen (5mg kg! body 
weight) for 3 days after surgery. 

Imaging window implantation. Cortical excavation and imaging window implan- 
tation were performed more than 10 days after the initial virus injection, according 
to published protocols'*3", A craniotomy (diameter 3mm) was made centred 
at A/P —1.5mm, M/L —1.5mm for CA1/DG imaging and A/P —1.5mm, M/L 
—2.5mm for CA3 recordings. For implantations over CA3, the head of the mouse 
was tilted by 20°, so that the implantation plane was parallel to the lateral part of 
the pyramidal cell layer. Parts of the somatosensory cortex and posterior parietal 
association cortex were gently aspirated while being irrigated with chilled saline. 
We continued aspiration until the external capsule was exposed. The outer part 
of the external capsule was then gently peeled away using fine forceps, leaving 
the inner capsule and the hippocampus itself undamaged. The imaging window 
implant consisted of a 3-mm diameter coverslip (CS-3R, Warner Instruments) 
glued to the bottom of a stainless steel cannula (3-mm diameter, 1.2-1.5-mm 
height). The window was gradually lowered into the craniotomy using forceps 
until the glass was in contact with the external capsule. The implant was then 
fixed to the skull using cyanoacrylate. Mice were allowed to recover from window 
implantation for 2-3 days. 

Barnes maze. To test for potential detrimental effects of our implantation on 
normal hippocampal function, we compared animals implanted for our imaging 
experiments with GFP-injected control mice (n= 8, each) in a Barnes maze para- 
digm. The mice were tested on three consecutive days on a 1-m diameter Barnes 
maze (Noldus) with 20 equally spaced holes at the perimeter in a sound-isolated 
chamber with distal visual cues on the walls. One session with four runs interleaved 
by approximately 30s was carried out per day. The location of the escape hole in 
relation to the visual cues was kept constant throughout experiments. The mouse 
was placed onto a starting point in a different quadrant of the maze for each trial. 
A trial ended after 2 min or when the mouse’s body had completely entered the 
escape hole. Mice were tracked using video tracking software (Ethovision, Noldus) 
and the distance travelled (m) was analysed. 

Virtual environment setup and behavioural training. Our custom virtual 
environment setup consisted of an air-supported polystyrene ball (20-cm diameter), 
similar to other published designs!**!~*°. A small metal axle was attached to the 
side of the ball to constrain the ball motion to the forward—backward direction. 
The motion of the ball was monitored with an optical sensor (G-500, Logitech) 
and translated into forward motion through the virtual environment. The forward 
gain was adjusted so that 4m of distance travelled along the circumference of the 
ball equalled one full traversal along the linear track. When the mouse had reached 
the end of the track, screens were blanked for 5-10s and the mouse was ‘teleported’ 
back to the start of the linear track. The virtual environment was displayed on four 
TFT monitors (19” screen diagonal, Dell) arranged in a hexagonal arc around the 
mouse and placed ~25 cm away from the head, thereby covering ~260° of the hori- 
zontal and ~60° of the vertical visual field of the mouse, similar to ref. **. The virtual 
environment was created and simulated using the open-source 3D rendering 
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software Blender*’. The track consisted of textured walls, floors and other 3D 
rendered objects at the tracks sides as visual cues (Extended Data Fig. 1g). Potential 
reward locations were marked with visual and acoustic cues, and 4,1 of soy milk 
was gradually dispensed through a spout in front of the mouse as long as the mouse 
waited in a rewarded location (see Supplementary Video 1). The simplified (‘poor’) 
track used for the experiments shown in Extended Data Fig. 8 was derived from the 
standard familiar context, but all rendered cues and objects were removed, except 
for a homogenous texture on the walls to both sides of the mouse. The context 
for the experiments in Extended Data Fig. 6 was essentially the same, but the first 
three reward locations were additionally removed to limit positional information 
to idiothetic cues alone. For the multisensory ‘rich’ context (Extended Data Fig. 8, 
Supplementary Video 6) a large number of 3D animated objects was added to the 
standard track. Furthermore, a brief (400-ms) 4-kHz tone was played as auditory 
background at 1 Hz repetition rate and a vanilla-scented piece of cardboard was 
placed into the behavioural setup. Finally, small cloth objects were attached to a 
foam disk (10-cm diameter), which was mounted on the axis of a stepper motor. 
The motor was controlled by an Arduino-board and programmed to move the 
objects into the range of the mouse’s whiskers according to its position on the 
virtual track (Supplementary Video 6). 

Five days after head plate implantation, mice were placed in the virtual envi- 
ronment for 10-30 min daily, with gradually increasing timespans. During this 
time, only the familiar context was available to the mice. After 4-5 days of habitu- 
ation, mice showed consistent running and reward-related licking. The mice were 
thereafter implanted with the cortical window and allowed to recover from the 
surgery. After recovery, food scheduling was initiated with a goal of 85-90% of 
the ad libitum body weight. Simultaneously, training in the virtual environment 
was re-initiated for 30-60 min daily in the familiar context until consistent reward 
licking was observed in all animals and familiarization to the context had been 
achieved for at least 10 days in total before start of the imaging sessions. 

From the first day of the imaging session, mice were introduced to a novel con- 
text, which had different visual cues and floor and wall textures, but had the same 
dimensions as the familiar context including the four marked reward locations. On 
the novel track, two of these reward sites were disabled (that is, the auditory cue 
was still given, but no reward was dispensed). Licking by the mice was monitored 
with a capacitive sensor attached to the metal lick spout. For some of the initial 
animals, no lick data were recorded. Mice alternatingly ran on the two tracks for a 
total of 15-30 runs on each track and day. The mice made 1-5 runs on one track 
and then an equal number of runs on the other. The length of these trial blocks 
was randomly varied. Imaging was performed with the same set of contexts for at 
least two (up to five) subsequent days. In many of the mice, the visible area under 
the imaging window was sufficiently large to select another imaging field of view 
that contained a different population of neurons. In these cases, we repeated the 
entire imaging experiment after the first experiment had been completed and used 
the new field of view and a different novel context (see Extended Data Fig. 5). In 
this manner, we performed a total of twelve experiments in six animals for the 
DG, seven experiments in five animals for CA3, and fifteen experiments in eleven 
animals for CA1 in the familiar—novel paradigm, as well as six experiments in three 
mice for DG imaging in the simplified environment, and seven experiments in six 
animals for the poor-normal-rich paradigm in CA1. 

In vivo two-photon calcium imaging. Imaging was performed using a resonant/ 
galvo high-speed laser scanning two-photon microscope (Neurolabware) with a 
frame rate of 30 Hz for bidirectional scanning. The microscope was equipped with 
an electrically tunable, fast z-focusing lens (optotune, Edmund Optics) to switch 
between z-planes within less than a millisecond. Images were acquired through a 
16x objective (Nikon, 0.8 N.A., 3mm WD), which was tilted at an angle of 20° for 
CA3 imaging. GCaMP6f was excited at 930 nm with a femtosecond-pulsed two 
photon laser (Mai Tai DeepSee, Spectra-Physics). We scanned three imaging planes 
(~25 1m z-spacing between planes) in rapid alternation so that each plane was 
sampled at 10 Hz. The planes spanned 300-500 xm in the x/y-direction and were 
placed so that as many principal cells as possible were depicted. An early subset of 
the CA1 imaging experiments was performed with a galvo/galvo laser scanning 
microscope (Femto 2D, Femtonics) and a Chameleon Ultra II two-photon laser 
(Coherent) using only a single plane for imaging. To block ambient light from the 
photodetectors, the animal’s head plate was attached to the bottom of an opaque 
imaging chamber before each experiment, and the chamber was fixed in the behav- 
ioural apparatus together with the animal. A ring of black foam rubber between the 
imaging chamber and the microscope objective blocked any remaining stray light. 
Histology and imaging area detection. At the end of experiments, mice were 
deeply anaesthetized using ketamine/xylazine (Sigma Aldrich) and image stacks of 
the area underneath the imaging window were acquired in vivo in the two-photon 
microscope (see Supplementary Video 2). Mice were then perfused transcardially 
with 4% paraformaldehyde in PBS. Brains were cut into 100-|1m coronal slices 
and sections containing the area underneath the imaging window were collected. 
Image stacks of GCaMPé6f and tdT fluorescence in the sections were acquired with 
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a confocal microscope (LSM 710, Zeiss) and the imaged region was re-identified by 
comparing these stacks with the ones obtained in vivo, as described above (Fig. 1c, 
Extended Data Fig. 2c). For CA2/3 imaging experiments, the location of the imaged 
areas in CA3 and CA2 was confirmed for all mice by referencing sections to an 
anatomical atlas*®. 

Imaging data processing, segmentation and data extraction. Motion correction 
of all imaging data was performed line-by-line using the SIMA software package*” 
with a 2D hidden Markov model**. When necessary, a pre-alignment step was 
performed using Matlab (version R2015b, MathWorks) built-in functionality 
(rigid transformation). Motion artefacts were estimated on either the red tdT or 
the green GCaMPé6f fluorescence channel, whichever gave the better result for a 
given dataset. If no decent motion correction could be achieved, the data were 
discarded. Next, the motion-corrected and time-averaged image of tdT for each 
run was used to align recordings from the same field of view relative to each other 
and their displacements were stored with the dataset. 

For obtaining data from principal cells, regions of interest (ROIs) were drawn 
manually around cell bodies (segmentation) in the principal cell layers of the three 
hippocampal subfields using ImageJ (NIH). Cell bodies were identified based on 
the motion-corrected, time-averaged GCaMP6f fluorescence images and re- 
inspected for each run to make sure that segmented cells were clearly visible 
throughout the experiment. We disambiguated GCs from other cell types by 
their small soma size and their location in the granule cell layer of the DG. We 
did not segment or include any neurons into this GC dataset that had unusually 
large somata or were located in the hilar region. Thereby, we made sure that our 
data represent the activity of GCs rather than hilar mossy cells or GABAergic DG 
interneurons**“”. In CA1 and CA2/3, only neurons in the pyramidal cell layers 
were segmented and included in the PYR dataset. Neurons in the pyramidal 
cell layer that co-expressed tdT were excluded from this dataset. For the dis- 
ambiguation of superficial- and deep-layer CA1 PYRs (Extended Data Fig. 9), 
we referenced our imaging planes with the 3D stacks of the entire hippocampal 
formation from the respective mice and manually selected PYR somata in regions 
that were close (~50 1m) to the oriens border for the deep and somata close to 
the radiatum border for the superficial group. PVI somata were identified in the 
red tdT channel and ROIs were drawn around their somata in stratum oriens, 
pyramidale and radiatum in CA1 and in the granule cell layer as well as the hilar 
region in the DG. 

The obtained ROIs were transformed according to the displacement between the 
mean GCaMPé6f fluorescence images as described above, and the average calcium 
signal over time was obtained from each ROIs for all runs. We restricted further 
analysis on running periods with a speed of at least 5cm s~!. We identified signifi- 
cant calcium transients, which reflect the firing of principal cells, as described**°*. In 
brief, calcium traces were corrected for slow changes in fluorescence by subtracting 
the eighth percentile value of the fluorescence-value distribution in a window 
of ~8s around each time point from the raw fluorescence trace. We obtained an 
initial estimate on baseline fluorescence and standard deviation (s.d.) by calculating 
the mean of all points of the fluorescence signal that did not exceed 3 s.d. of the 
total signal and would therefore be likely to be part of a significant transient. We 
divided the raw fluorescence trace by this value to obtain a AF/F trace. We used 
this trace to determine the parameters for transient detection that yielded a false 
positive rate (defined as the ratio of negative to positive oriented transients) <<5% 
and extracted all significant transients from the raw AF/F trace. Definitive values 
for baseline fluorescence and baseline s.d. were then calculated from all points 
of this trace that did not contain significant transients. For further analysis, all 
values of this AF/F trace that did not contain significant calcium transients were 
masked and set to zero. 

For PVIs, rigorous identification of spike or burst-related calcium transients 
was prevented by their high firing rates, which exceeds the acquisition rate of our 
imaging system and the kinetics of the calcium indicator*!~“°, However, the calcium 
signal from PVIs can still be used to approximate PVI firing rate over time!*?56, 
To obtain baseline-normalized AF/F calcium traces from PVIs, we extracted their 
raw fluorescence signals and divided them by their mean fluorescence in periods in 
which the virtual environment screens were blanked between the runs. Therefore, 
the AF/F trace of PVIs reports the change in activity rates of the cells due to exposure 
to one of the contexts. 

Activity differences and spatial information. Activity difference scores in Fig. 2a 
were calculated for each cell using the following formula: (activityfamitiar - 
ACtIVItY novel) /(ACtiVItY familiar + ACtIVILYpovel). To calculate a measure for spatial infor- 
mation (SI) content for principal cells, we adapted a traditional method of SI assess- 
ment’ to calcium imaging data. To calculate SI, the average calcium activity (mean 
AF/F) was computed for each 10-cm-wide bin along the linear track and used as 
an approximation for the neurons average firing rate in that location. SI was then 
calculated for each cell as SI= >, In%p, in which ), and p; are the average 
calcium activity and fraction of time spent in the i-th bin, respectively, \ is the 
overall calcium activity averaged over the entire linear track and N is the number 


of bins on the track (40 in our case). Therefore, the amount of spatial information 
is inferred from differences in the calcium activity and reported as bits per s. 
Place field identification. Place fields were identified according to published 
methods*!*, In brief, the mean AF/F was calculated from significant calcium 
transients for 80 position bins (each 5-cm wide) and this mean fluorescence over 
distance plot was then smoothed by averaging over the three points adjacent to 
each bin. Potential place fields were initially identified as contiguous regions of 
this AF/F over distance plot in which all of the points were greater than 25% of 
the difference between the bin with the highest AF/F value and the baseline value 
(mean of the lowest 20 out of 80 bins’ AF/F values). In addition, the candidate place 
fields had to fulfil the following criteria: (1) the potential field had to have a width 
of at least 3 bins (corresponds to 15cm running distance on the ball circumfer- 
ence); (2) the mean AF/F value inside the field had to be at least seven times the 
mean of the AF/F value outside the field; and (3) significant calcium transients 
had to be present at least 20% (for CA1-3 PYRs) and 10% (for GCs) of the time 
in which the mouse was moving in the field. Potential place fields that fulfilled 
these criteria were accepted if their P value from bootstrapping exceeded 0.05. For 
bootstrapping, the AF/F trace for each experiment was broken into segments of 
at least 50 consecutive imaging frames and randomly shuffled. This was repeated 
1,000 times for each cell. Then the place field detection procedure, as described 
above, was performed on each of the shuffled AF/F traces. The P value of the place 
field was then defined as the number of these randomly shuffled traces on which a 
place field was detected according to the outlined criteria divided by the number 
of shuffles (1,000). Overall, the criteria for place cell identification were relatively 
conservative and may underestimate the fraction of detected place cells among the 
active cells (Fig. 1f). The best parameters for place field detection were determined 
on the first obtained datasets by systematic variation and optimized to detect the 
maximum number of significant place fields. 

Place field stability, consistency and discrimination between contexts. To assess 
the similarity of a place cell’s spatial representation on different contexts or days, 
we first divided the track into 40 bins and calculated the mean AF/F value for each 
bin on the track, based on all significant calcium transients (activity map) for each 
individual cell. One of these maps was computed for each context and day in all 
cells. The stability of place field of a cell was measured as the cross-correlation of 
the mean activity maps for runs in the same context on two different days. To assess 
whether the mean place field stability of cells in a region between a given day anda 
target day was above chance (Fig. 4d; Extended Data Fig. 10c), we generated chance 
distributions by correlating each cell’s activity on the selected day with that of a 
randomly chosen cell on the target day. The consistency of place field firing was 
determined as the cross-correlation between the average activity of the first and 
the second block of five consecutive runs on the same track and session. Finally, 
the similarity between contexts was determined as the correlation of mean activity 
maps for runs in the familiar context and runs in the novel context on the same 
day (Figs. 2d, 3g, Extended Data Fig. 3g). To calculate the trial-to-trial reliability 
(Fig. 3f, Extended Data Figs. 3i, 7e), we calculated the pairwise cross-correlations 
between the calcium signals of all individual runs in one session on the same track 
and averaged the obtained values for each cell. We computed PoVs from the activity 
of all imaged cells in a session by stacking their average activities over distance on 
top of each other. PoV correlations between days or contexts were then determined 
as the cross-correlation of these cellular activities in each 10-cm bin along the linear 
track between two different contexts or days!“*, 

Place field shift and centre of mass. To assess the shift of place fields (Extended 
Data Fig. 7c), we first calculated the centre of mass (COM) for each place field 
based on the mean activity map over track distance according to the following 


N 
equation: COM = ea 


in which N is the number of bins covered by the place 
field, AF; is the fluorescence in the i-th bin of the place field and x; is the distance 
of the i-th bin from the start of the track. For all cells that had a defined place field 
on the same linear track on two consecutive days, the place field relocation distance 
was defined as the distance between the two COMs of the place fields. 

Statistics. All statistical tests are described in the corresponding figure legends. All 
comparisons were two-sided. Unless indicated otherwise, statistical comparisons 
were made between cells fulfilling the individual criteria as specified in the figure 
legend. The reported n numbers exclude missing (‘NaN’) values. All ANOVA tests 
are one-way tests. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Code availability. Any custom written code is available upon request. 

Data availability. The data that support the findings of this study are available 
from the corresponding author upon reasonable request. 
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Extended Data Fig. 1 | Virtual environment behavioural paradigm for 
head-fixed mice. Related to Fig. 1. a, Mean number of licks per spatial 
bin (10-cm wide) for one example mouse on day 1 (left) and day 2 (right) 
on the familiar (top, grey) and novel (bottom, blue) linear tracks. Blue 
shaded areas indicate reward zones. b, Mean lick rate per bin as a function 
of distance from the next reward location for the familiar (black traces) 
and novel (blue traces) contexts. Shaded areas denote s.e.m.,n=15 
experiments. c, Mean lick rate over the entire familiar (left) or novel 
(right) track on day 1 (left) or day 2 (right). Grey lines denote individual 
experiments (m= 15) and black circles with error bars show mean +s.e.m. 
d, As in c but for mean lick rate in the reward zones only. e, Reward-related 
licking plotted for the experiments continued over 5 days. In this subset of 
the data (n = 5 experiments) no significant difference in licking between 
contexts was observed on any day (repeated-measures one-way ANOVA), 
although there was a trend towards lower lick rates in the novel context. 

f, Lick rate increase in the reward zone (as compared to licking on the 
remaining track). An increase in the fraction of reward-related licks was 


barnes maze 


observed only between day 1 and 2 (n=5 experiments). g, Screenshots of 
the familiar context and the four different novel context sceneries. One 
novel context was randomly selected for each experiment from this set and 
maintained through all days. If more than one experiment was performed 
using a given animal, a different novel context was chosen for each of the 
experiments. h, Intact spatial memory was probed in the experimental 
mice and GFP-injected controls in a Barnes maze learning paradigm (see 
Methods). Blue traces show a mouse’s trajectory in the last of four sessions 
on the first (top) and third (bottom) days of the experiment. i, Mean 

path length per session for implanted mice used for imaging experiments 
(n=8 mice; green) and GFP-injected controls (n = 8 mice; grey) on 

days 1-3 (repeated-measures one-way ANOVA). There was no significant 
difference in path length between groups (day 1: P=0.96, day 2: P= 0.806, 
day 3: P=0.915, two-sided t-test). c, d, f, Two-sided paired t-test. 
*P<0.05, **P< 0.01, ***P < 0.001, n.s. not significant. Error bars denote 
s.e.m. throughout. For exact P values see Supplementary Table 1. 
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Extended Data Fig. 2 | Imaging configuration and CA3 imaging 
locations. a, Schematic of the transcortical window implantation. A 
stainless steel cannula (3 mm diameter) with a circular coverslip attached 
to the bottom is implanted into the brain and rests on the external capsule 
on top of the hippocampus (see Methods). b, Illustrative fluorescence time 
series of an active GC, including the time-averaged GCaMP fluorescence 
(AVG, left). c, Photograph of a mouse in the virtual environment setup. 
Depicted is a mouse in the CA2/3 recording group with tilted objective 
for lateral access view (see Methods). d, Anatomical drawing of the 

dorsal hippocampus indicating the imaging planes for CA3 recordings. 
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Depicted is the location of the middle plane from a three-plane imaging 
volume of 40-80 1m. Coloured numbers indicate individual mice from 
which the data are derived; in cases for which more than one experiment 
was performed in a mouse, letters denote the locations of the respective 
experiments. e, Box plot of place field stability values over days from 
familiar-context place cells in the individual recordings shown in d. White 
numbers denote the number of place cells from each experiment that 

are represented by the boxes. Boxes, 25th to 75th percentiles; white bars, 
median; whiskers, 99% range. 
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Extended Data Fig. 3 | See next page for caption. 
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Extended Data Fig. 3 | Properties of place coding in the familiar and 
novel virtual contexts. Related to Fig. 2. a, Cumulative distribution of 
mean calcium activity, based on the area under the calcium curve, for all 
GCs (red), CA2/3 PYRs (blue) and CA1 PYRs (black) recorded in 

the familiar context. b, As in a, but for the calcium transient rate. 

c, Cumulative distribution of place field widths for all day 1 familiar- 
context place cells in the respective hippocampal areas. d, Experimental 
schematic: place field consistency was measured as the correlation of the 
average activity on the first and second blocks of five runs in the same 
context (familiar: F-F’ or novel: N-N’). Place field discrimination was 
assessed as the correlation of activity maps for different contexts (F-N). 
e, Activity over distance for cells with a place field on the novel track. Cells 
were sorted according to their peak activity in novel-track runs and are 
plotted separately for the first (left) and second (middle) blocks of runs 
on the novel track and for all runs on the familiar track (right). For the 
same plots with familiar-context place cells, see Fig. 2c. f, Number of cells 
with significant place fields (see Methods) in each experiment. Thin lines 
denote individual experiments (DG, n= 12; CA2/3, n=7; CA1, n=15 
experiments), thick lines with error bars show mean + s.e.m. 

(one-way repeated measures ANOVA). g, Median activity-map 


LETTER 


correlations between the familiar and novel contexts for experiments 
with at least 20 place cells on day 1 (circles; DG, n = 6; CA2/3, n=4; CAI, 
n=11 experiments) and their mean + s.e.m. (ANOVA, Holm-Sidak). 

h, Pearson's correlation of place-related activity for cells that had a place 
field in the familiar context (left) or novel context (right) on day 1. Left 
box of each pair shows correlation of activity maps for trial blocks on the 
same track (‘consistency’) and right bar shows correlations with trials on 
the other track (‘discriminatior’) in the same session. Correlations within 
the same context were always significantly higher than between contexts, 
except for GCs that had newly acquired a place field on the novel track. 

i, Trial-to-trial reliability of place cell responses. Place-related firing was 
more reliable in the familiar than in the novel context and higher in GCs 
than in PYRs in the familiar context. Differences between areas were not 
significant for the novel context. j, Each figure shows the run-by-run 
calcium activity (colour coded) over distance from one individual place 
cell for one session in the familiar context. Rows denote individual runs. 
h, i, Boxes, 25th to 75th percentiles; white bars, median; whiskers, 99% 
range. a—c, h, i, One-way ANOVA on ranks, Dunn’s test. *P < 0.05, 

**P < 0.01, ***P < 0.001; n.s., not significant. For exact P values see 
Supplementary Table 1. 
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Extended Data Fig. 4 | Task-related activity of PV-expressing 
interneurons in CA1 and DG. a, Illustrative, time-averaged fluorescence 
image of GCaMP6f (pseudocolour white) and tdT (red) expressed in 
PVIs. Insets below show each fluorescence channel separately for the area 
indicated with the dotted line. b, Calcium trace of a representative PVI in 
CA1 (black) and distance on the virtual linear track (blue) over time. This 
PVIis active particularly at times when the animal moves fast. c, Calcium 
activity (colour coded) of the representative cell shown in b as a function 
of linear track distance for multiple runs (rows). Graph at the bottom 
shows the average of activity over distance for this cell in the familiar 
context. Shaded areas denote s.d. The same analysis was performed for 

78 CA1 and 19 DG PVIs. d, As in c, but activity was plotted as a function 


PVI activity familiar track 


running speed (cm/s) PVI activity familiar track 


of running speed. e, f, Mean activity maps over distance for all recorded 
CA] (e) or DG (f) PVIs in the familiar and novel contexts. g, h, Same as in 
e, f, but mean activity was plotted over running speed. Activity in most DG 
PVIs is suppressed in the novel context. i, Activity of PVIs in the DG on 
the familiar (red) and novel track (light red) over running speed. Shaded 
areas denote s.d. R denotes Pearson's R. j, Mean calcium activity during 
familiar-track running plotted against novel-track activity for PVIs in the 
DG. Inset, bars denote mean + s.e.m. of running-related activity for novel 
and familiar tracks (two-sided signed rank-sum test). k, 1, As in i, j but 

for CA1 PVIs. ***P < 0.001; n.s., not significant. For exact P values see 
Supplementary Table 1. 
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Extended Data Fig. 5 | Differential stability and context discrimination 
of CA1 and DG are not due to interindividual differences. 

a, Experimental schematic. Left, fluorescence image of GCaMP6f 

(white) and tdT in PVIs (red) in a post mortem coronal brain section. 
Dotted line indicates position of the imaging window. In this mouse, 
recordings were made from CA1 PYRs and DG GCs in separate, sequential 
experiments (exemplary image planes are shown in middle images). For 
each experiment, the same familiar and a different novel context (right) 
were used. Thereby, the coding properties of PYRs and GCs in the same 
animal could be compared. b, Calcium activity over distance for CA1 
PYRs (top) and DG GCs (bottom) with place fields on the familiar track 
sorted for their peak activity on day 1 in the familiar context. Activity 

of the same cells with the same sorting on day 2 in the familiar context 
(middle) and on day 1 in the novel context (right). c, Mean cellular activity 
map correlations (colour coded: Pearson's R) over two days and contexts 

as indicated on the x-axis. Data sampled only for place cells recorded in 
the selected mouse. Each row shows mean correlation values for cells that 
had a place field on the day and track indicated on the y-axis (n denoted 

as white numbers). d, Left, correlations of activity maps in the familiar 
context between days plotted for cells that had a place field in the familiar 
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context. Cells were sampled only from measurements in this particular 
mouse. Right, activity map correlations between the familiar and novel 
contexts on day 1 for all cells that had a place field in the familiar context 
on that day. Stability over time and activity map similarity between 
contexts are significantly higher for GCs than for CA1 PYRs in the same 
mouse. e-g, As in b-d, but calculated separately on cells from another 
mouse. h, Activity map correlations between days 1 and 2 were calculated 
for familiar-context place cells and medians (circles) are displayed for 
each animal that had a minimum of 20 such place cells (DG, n=4; 
CA2/3,n=4; CA1, n=9 mice). The means + s.e.m. of these per-animal 
medians (bars) were compared statistically. i, As in h, but for activity map 
correlations of familiar-context place cells on day 1 between the familiar 
and novel contexts (DG, n= 3; CA2/3, n=3; CA1, n=9 mice). Higher 
temporal stability and higher inter-context similarity are a feature of GCs 
that is consistently observed in different mice. h, i, One-way ANOVA 
with Holm-Sidak test. Error bars denote s.e.m. d, g, Boxes, 25th to 75th 
percentiles; white bars, median; whiskers, 99% range. Two-sided rank-sum 
test. *P< 0.05, ** P< 0.01, *** P< 0.001; n.s., not significant. For exact 

P values see Supplementary Table 1. 
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Extended Data Fig. 6 | Spatial firing of DG GCs requires external 
reference cues. a, Screenshot from the simplified virtual linear track 
devoid of any visual reference cues, except for patterned walls to provide a 
visual percept of self-motion. b, Cumulative distribution of activity levels 
for all GCs imaged in the standard familiar and novel contexts, as well as 


O° 


2 08 
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the simplified version shown in a. c, Cumulative distribution of spatial 
information values in all cells with a minimum activity of 0.01 transients 
per second in the familiar, novel and self-motion-only based paradigms. 
**P < 0.01, ***P< 0.001; n.s., not significant (one-way ANOVA on ranks 
with Dunn's test). For exact P values see Supplementary Table 1. 
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Extended Data Fig. 7 | Place field remapping between days. Related 
to Fig. 3. a, Mean cellular activity map correlations (Pearson's R) over 
two days and contexts as indicated on the x-axis. Each row shows mean 
correlation values for cells (white numbers denote n) that had a place 
field on the day and track indicated on the y-axis. b, Median day-to- 
day correlation (‘stability’) of familiar-context place cell activity for all 
experiments in which at least 20 cells had a place field in the familiar 
context on either day (circles; DG, n = 6; CA2/3, n=6; CA1, n=12 
experiments). Bars denote mean + s.e.m. of these per-experiment values 
(one-way ANOVA with Holm-Sidak test). ¢, For all cells that had a place 
field in the familiar context on both days of the experiment, the centres 
of mass for the activity in these place fields was determined. The graph 


shows the cumulative distribution of the distances between these centres 
(shift) between days 1 and 2, which gives a measure of the relocation of 
place fields between days. d, Trial-by-trial correlation of place cell activity 
(reliability) in the familiar and novel contexts on days 1 and 2. N= 354, 
367, 189, 226, 242, 364, 252, 285, 1,009, 1,044, 660 and 1,074 place cells 
per group (left to right). Place cell reliability for the novel context place 
cells increases in CA1 and CA2/3 between the first and second days. 
Boxes, 25th to 75th percentiles; white bars, median; whiskers, 99% range. 
c, d, One-way ANOVA on ranks with Dunn's test. *P < 0.05, **P < 0.01, 


*** P< 0.001; n.s., not significant. For exact P values see Supplementary 
Table 1. 
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Extended Data Fig. 8 | CA1 place coding is degraded in the absence of 
visual references, but does not scale with environmental complexity. 

a, Screenshots of the three different virtual linear tracks. Left, the ‘poor 
track had patterned walls, but no other cues; middle, the ‘normal’ track 
with visual references; right, the ‘rich multisensory environment with 
many visual objects, sound, odour and tactile cues (Supplementary 

Video 6). b, Calcium activity over distance for CAl PYRs with place fields 
on the tracks depicted above, sorted for their peak activity on day 1 (right) 
and, with the same sorting, on day 2 (left). Higher activity levels and day- 
to-day stability can be observed in the ‘normal and ‘rich’ environments. 
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c, Number of cells with significant place fields (see Methods) on the first 
recording day per experiment. Thin lines denote individual experiments 
(n=7), thick lines with error bars the means + s.e.m. (repeated measures 
one-way ANOVA). d, Calcium activity levels (AUC) of the place cells 
detected in the three settings. e, As in d, but for spatial information. 

f, Activity map correlations between days for all cells that had a place 
field on the corresponding track. d-f, One-way ANOVA on ranks with 
Dunn's test. Boxes, 25th to 75th percentiles; white bars, median; whiskers, 
99% range. ***P < 0.001; n.s., not significant. For exact P values see 
Supplementary Table 1. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


deep familiar context place cells 


oriens 


cell# 


familiar. 
day4 


cell# 


radiatum. . 
4| 
0 distance (m)4 
f xe ns. 
0.5 
1 Wi superficial 
a Hi deep 
0.4 £08 
i 2 ns. ns 
§ ® 0.6 
S03 £087 T ] 2 
<x 2 mo +t SG is 
= = N © a Ss 
2 e047) TF 
© 0.2 3 i“ i“ 
Py 2 
= B02 
3 2 
© 0.1 © 
8 ie} 
A * F-F' N-N' 
sup deep 0.2 
n=2022  n=2153 


Extended Data Fig. 9 | Superficial and deep layer CA1 pyramidal cells 


differ in their task-related coding properties. a, Experimental schematic. 


Cells with somata close to the border of the stratum oriens (deep CA1 
PYRs) and those close to the border of the stratum radiatum (superficial 


CA1 PYRs) were identified in different z-planes and separated for analysis. 


Illustrative pictures to the right show time-averaged fluorescence of 
GCaMPéf (white) and td-Tomato (red) in PVIs. b, Calcium activity over 
distance for deep CA1 PYRs with a place field on the familiar track. Cells 
were sorted according to their peak activity on the familiar track (left) and 
are plotted in the same order for runs on day 1 on the novel track (middle) 
and for the runs on the familiar track on the second day (right). c, As in 

b, but for novel-context place cells on the familiar context (middle) and 
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novel context on the second day (right). d, e, As in b, c, but for superficial 
PYRs. f, Calcium activity rate for superficial and deep CA1 PYRs in the 
familiar and novel contexts. Activity rates increased significantly in the 
novel context in superficial but not in deep PYRs (two-sided signed 
rank-sum test). g, Trial-to-trial correlations of place cell activity compared 
between layers for familiar-context (left) and novel-context (right) runs. 
h, Activity map correlations between contexts for cells with a place field 
in the familiar context on day 1. i, Cumulative distribution of place cell 
activity map correlations between days 1 and 2 during familiar context 
runs. g-i, Two-sided rank-sum test. *P < 0.05, **P< 0.01; n.s., not 
significant. Boxes, 25th to 75th percentiles; white bars, median; whiskers, 
99% range. For exact P values see Supplementary Table 1. 
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A molecular rheostat adjusts auxin flux to promote 
root protophloem differentiation 


P. Marhaval, A. E. L. Bassukas2>, M. Zourelidou2, M. Kolb2?, B. Moret!, A. Fastner?, W. X. Schulze’, P. Cattaneo!, 


U. Z. Hammes??, C. Schwechheimer?* & C. S. Hardtke! 


Auxin influences plant development through several distinct 
concentration-dependent effects!. In the Arabidopsis root tip, polar 
auxin transport by PIN-FORMED (PIN) proteins creates a local 
auxin accumulation that is required for the maintenance of the stem- 
cell niche?~*. Proximally, stem-cell daughter cells divide repeatedly 
before they eventually differentiate. This developmental gradient 
is accompanied by a gradual decrease in auxin levels as cells divide, 
and subsequently by a gradual increase as the cells differentiate®. 
However, the timing of differentiation is not uniform across cell 
files. For instance, developing protophloem sieve elements (PPSEs) 
differentiate as neighbouring cells still divide. Here we show 
that PPSE differentiation involves local steepening of the post- 
meristematic auxin gradient. BREVIS RADIX (BRX) and PROTEIN 
KINASE ASSOCIATED WITH BRX (PAX) are interacting plasma- 
membrane-associated, polarly localized proteins that co-localize 
with PIN proteins at the rootward end of developing PPSEs. Both 
brx and pax mutants display impaired PPSE differentiation. Similar 
to other AGC-family kinases, PAX activates PIN-mediated auxin 
efflux, whereas BRX strongly dampens this stimulation. Efficient 
BRX plasma-membrane localization depends on PAX, but auxin 
negatively regulates BRX plasma-membrane association and 
promotes PAX activity. Thus, our data support a model in which 
BRX and PAX are elements of a molecular rheostat that modulates 
auxin flux through developing PPSEs, thereby timing PPSE 
differentiation. 

Auxin is a concentration-dependent permissive-restrictive signal 
in plant cell proliferation and differentiation—-elongation that directly 
impinges on adaptive processes and growth rates”. Local auxin accu- 
mulations are important cues for organ organization. For example, high 
auxin concentration specifies the stem-cell niche in the Arabidopsis root 
tip’. Proximally, auxin concentration decreases gradually as stem-cell 
daughters repeatedly divide before they eventually differentiate. Notably, 
differentiation is accompanied by a renewed rise in auxin levels>®. 
The underlying auxin distribution is generated by plasma-membrane- 
integral PINs, which are auxin efflux carriers with a coordinated 
asymmetric cellular localization that gives rise to directional polar 
auxin transport?~“*. In root vasculature, PINs generally localize to the 
rootward end of cells, transporting auxin towards the root tip*. PINs are 
regulated by auxin, predominantly post-translationally’°. Moreover, 
the AGC-family kinases D6 PROTEIN KINASE (D6PK) and PINOID 
(PID) activate auxin efflux through PIN phosphorylation!*-””. 

The proximo-distal auxin profile in root meristems intersects 
with differential auxin activity in the radial dimension. For example, 
developing PPSEs (Extended Data Fig. la, b) display higher auxin 
accumulation than surrounding cells° (Extended Data Fig. 2a-d) 
and differentiate, whereas neighbouring cells still remain meristem- 
atic®!3-4 (Extended Data Fig. 2e). To explore whether PPSE differen- 
tiation depends on auxin activity, we manipulated the auxin response 
by expressing a constitutively active variant of an auxin-response 
factor, MONOPTEROS (MP*)!5, under the control of PPSE-specific 


COTYLEDON VASCULAR PATTERN 2 (CVP2) promoter!*!°, 
CVP2::MP* accelerated PPSE differentiation, indicating that auxin 
responses critically determine the differentiation process (Extended 
Data Fig. 2f, g). 

How differential auxin activity is achieved in PPSEs remained 
unclear. BRX is plasma-membrane-associated, polarly localized and 
specifically expressed in developing PPSEs'*!”. In brx mutants, PPSEs 
frequently fail to differentiate'?. These cells lack the characteristic cell- 
wall changes and appear as gaps in the PPSE differentiation zone!*!®!8 
(Extended Data Fig. 2h). A similar phenotype is observed in octopus 
(ops) mutants'*!7, which are affected in a parallel genetic pathway 
required for PPSE differentiation!®. Whereas OPS localizes to the 
shootward end of PPSEs, BRX co-localizes with PINs at the rootward 
end**!*!8. Auxin negatively regulates BRX protein abundance and 
plasma-membrane association, but induces BRX transcription!®”°. 
Thus, BRX is a candidate for mediating auxin effects in PPSE differen- 
tiation. In brx PPSEs, auxin accumulation as compared to neighbouring 
cells was markedly lower and more variable than in the wild type 
(Extended Data Fig. 2i, j). Although CVP2::MP* expression in brx did 
not reduce the proportion of PPSE strands with gaps (Extended Data 
Fig. 2k), it significantly stimulated root growth (Extended Data Fig. 21) 
and reduced gap size (Extended Data Fig. 2m). Such partial rescue 
was not observed with another PPSE-specific promoter that was inac- 
tive in gap cells (Extended Data Fig. 2k—n). Moreover, impaired PPSE 
differentiation was observed after pharmacological inhibition of auxin 
biosynthesis (Extended Data Fig. 20, p), and brx protophloem defects 
were aggravated by genetic interference with auxin uptake (Extended 
Data Fig. 2q). These observations support the hypothesis that finely 
tuned auxin activity contributes to PPSE differentiation. 

BRX protein is expressed only at low levels and in few cells, compli- 
cating cell-biological and biochemical investigations of BRX in its native 
context. However, a recently established trans-differentiation assay for 
sieve element formation! (Extended Data Fig. 3a, b) enabled us to 
perform proteomics analyses in a native cell type and identify specific 
BRX interactors by immunoprecipitation (Extended Data Fig. 3c, d). 
Among them, we retrieved D6PK and several D6PK-LIKE (D6PKL) 
kinases as well as PINs, but by far the most abundant was a D6PK/ 
D6PKL-related kinase (AT2G44830)”, which we named PROTEIN 
KINASE ASSOCIATED WITH BRX (PAX). 

To examine a potential role of AGC kinases in PPSE differentiation, 
we analysed d6pk/d6pkl as well as pax mutants. D6PK/D6PKL genes 
display substantial genetic redundancy and, consistent with normal PIN 
phosphorylation in their roots!®!!°, d6épk0123 quadruple mutants had 
only a mild, possibly enhanced root-growth phenotype (Extended Data 
Fig. 4a). By contrast, pax loss-of-function mutants displayed reduced 
primary root growth (Fig. 1a), which was accompanied by PPSE differ- 
entiation defects (Fig. 1b-e). No phenotype was observed in a mutant 
of the closest PAX homologue, the uncharacterized PAX-LIKE (PAXL) 
kinase (AT5G40030)**, and pax! mutation only mildly enhanced the 
pax phenotype (Fig. 1b). A PAX-CITRINE fusion protein expressed 
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Fig. 1 | Phenotypic characterization of pax mutants. a, Root length 

of seven-day-old mutant and wild-type A. thaliana L. Heynh reference 
accession Columbia-0 (Col-0) seedlings. pax-1 and pax-2 are two 
independent PAX loss-of-function alleles. All data that are displayed 
subsequently were generated using pax-1. Box plots throughout show the 
second and third quartiles, maximum, minimum and mean (white dot). 
Statistically significant differences are indicated (two-sided Student’s 

t-test; *P < 0.0008). b, Quantification of protophloem strands with gap 
cells, 6-day-old seedlings. Statistically significant differences are indicated 
(Fisher’s exact test, two-sided, all P values < 0.001; a, significantly different 


under its native promoter complemented the PPSE differentiation 
phenotype of pax mutants (Extended Data Fig. 4b) and revealed PAX 
expression in developing protophloem, as well as weaker expression 
in the xylem axis (Fig. 2a). PAX displayed rootward cellular polarity 
(Fig. 2b) and co-localized with BRX (Fig. 2c). Exclusive expression of 
PAX-CITRINE in developing PPSEs, under the BRX promoter, fully 
rescued the pax protophloem phenotype (Extended Data Fig. 4c, d). As 
previously observed in brx, the pax PPSE differentiation defects were 
accompanied by impaired phloem sap delivery into the meristem!* 
(Extended Data Fig. 4e). In summary, pax mutants represent a (hypo- 
morphic) phenocopy of brx mutants. 

The brx phenotype was not enhanced in brx pax double mutants 
(Extended Data Fig. 4f), suggesting that brx is genetically epistatic to 
pax. In turn, the pax phenotype was not significantly enhanced by d6pk/ 
d6pkl mutations (Extended Data Fig. 4g). However, similar to DoPK 
or PID, PAX (and PAXL) activated auxin efflux when co-expressed 
with PINs in Xenopus laevis oocytes’? (Fig. 2d, Extended Data Fig. 4h). 
However, PAX was the weakest activator in this assay. Moreover, sim- 
ilar to D6PKL proteins, ADP-ribosylation factor-guanine-exchange 
factor (ARF-GEF) inhibition by brefeldin A (BFA) triggered rapid 
dissociation of PAX from the plasma membrane (Fig. 3a, Extended 
Data Fig. 4i). BRX is also BFA-sensitive!®, yet in direct comparison, 
BFA-induced BRX plasma-membrane dissociation was slower than 
for PAX (Fig. 3a, Extended Data Fig. 4j, k). Consistently, BFA treat- 
ment also triggered PPSE differentiation defects in a dosage-dependent 
manner (Extended Data Fig. 41, m). Moreover, BRX abundance, but 
not PIN abundance, was severely reduced in pax PPSEs (Fig. 3b-g). By 
contrast, PAX abundance or localization did not substantially depend 
on BRX (Extended Data Fig. 4n). In protoplasts, BRX localized evenly 
at the plasma membrane, whereas PAX accumulated in large patches”4 
(Extended Data Fig. 40). Their co-expression recruited BRX into PAX 
patches. However, a cytoplasmic PAX variant” did not disrupt the even 
plasma-membrane distribution of BRX. These results suggest that PAX 
is required for efficient BRX plasma-membrane recruitment. 

Auxin activity is systemically reduced throughout brx root meris- 
tems°. We thus sought to monitor PIN activity in brx or pax. We focused 
our analysis on the dominant PIN in developing PPSEs, PIN1 (Extended 
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to Col-0; b, significantly different to pax; c, significantly different to pax 
paxl). c, Confocal microscopy of propidium iodide (PI)-stained root 
meristems. Asterisks indicate PPSE strands and the arrowhead indicates 

a gap cell in the pax protophloem. d, Confocal microscopy of Col-0 and 
pax root meristems (ClearSee fixation with PI staining), showing both 
protophloem poles. Note PPSE cell files (white arrowheads) that start with 
meristematic cells and end with mature empty sieve elements. In one pax 
pole, PPSE differentiation is perturbed (gap cells, orange arrowhead). 

e, Expanded view, highlighting a gap cell in a pax PPSE cell file 
(arrowhead). n, number of independent biological replicates. 


Data Fig. 5a). Corroborating the PIN1-green fluorescent protein (GFP) 
results, PIN] abundance and localization in both the protophloem 
and the meristem were not affected in pax or brx mutants (Extended 
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Fig. 2 | Expression analysis of PAX protein. a, Top, Confocal microscopy 
of the PAX-CITRINE fusion protein (green fluorescence) expressed under 
its native promoter in the meristem (longitudinal plane). Bottom, optical 
cross section. Asterisks indicate PPSE cell files and arrowheads indicate 
the xylem axis. b, Detection of endogenous BRX (red) using anti-BRX 
antibody staining, or PAX—CITRINE (red) or OPS-CITRINE (green) 
using anti-GFP antibody staining, in protophloem of fixed meristems 
(squashed after fixation). Arrows point rootward. c, Simultaneous 
detection of PAX-CITRINE (green) and BRX (red) by immunostaining, 
demonstrating co-localization. d, Auxin transport assays, average retention 
of radio-labelled auxin in X. laevis oocytes expressing the indicated 
heterologous plant proteins (n = 10 per time point; error bars, s.e.m.). 
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Fig. 3 | PAX-dependence of efficient BRX plasma-membrane 
association. a, Plasma-membrane dissociation of PAX-CITRINE and 
BRX-CITRINE in response to BFA treatment. b, Simultaneous detection 
of nuclear NLS-VENUS protein (green) expressed under the control 

of PPSE-specific CVP2 promoter and BRX (red) by antibody staining. 
Asterisks in all microscopy images indicate the PPSE cell file. c, Detection 
of BRX (red) by antibody staining. d, BRX signal intensity quantification 
(anti-BRX antibody detection) in PPSEs (mean from approximately ten 


Data Fig. 5b). To survey PIN1 activity, we performed immuno- 
staining with antibodies against PIN1 phosphosites that are critical for 
PIN] activation’»”. Phosphoserine $231 (J231) signal was significantly 
reduced in pax PPSEs, whereas phosphoserine $271 (J271) was not 
affected (Extended Data Fig. 5c-e). By contrast, both phosphoserines 
were barely detectable in brx meristems (Extended Data Fig. 5c, f). 
Reduced PIN1 phosphorylation was also observed in ops (Extended 
Data Fig. 5f), suggesting that meristem-wide reduced PIN] activity is 
a secondary systemic consequence of severely disturbed PPSE differ- 
entiation, similar to other traits!®. Yet, brx or pax protophloem defects 
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Fig. 4 | Regulatory input of auxin on PAX and BRX activity. a-d, Auxin 
transport assays in X. laevis oocytes expressing the indicated heterologous 
plant proteins (n = 10 per time point; error bars, s.e.m.). e-f, Response 

of BRX (e) or PAX (f) fusion protein to 5 1M auxin (IAA) treatment 

(3h), with quantification (arbitrary units; means from approximately 

ten cells per root). The statistically significant difference is indicated, 
two-sided Student's t-test. g, Radioactive in vitro kinase assays with GST 
fusion proteins of D6PK, PAX or the PAX(S596A) and PAX(S596D) 

point mutants, with the PIN1 cytosolic loop as substrate (top), and 
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cells per root, arbitrary units). e, Simultaneous immunolocalization of 
PIN1-GFP (anti-GFP, green) expressed under its native promoter and 
BRX (anti-BRX, red) by antibody staining in Col-0 and pax. f, g, BRX 
signal intensity quantification (f, anti-BRX antibody detection) and PIN1- 
GFP signal intensity (g, anti-GFP antibody detection) in PPSEs (means 
from approximately ten cells per root, arbitrary units). d—g, Statistically 
significant differences from Col-0 are indicated, two-sided Student's t-test. 


were aggravated in the presence of a pin] mutation (Extended Data 
Fig. 5g-i). In brx pin1 double mutants, protophloem was frequently 
barely distinguishable, or even absent (approximately 20% of seedlings) 
(Extended Data Fig. 5i), underlining the importance of properly regu- 
lated auxin transport for PPSE differentiation. 

The systemic ramifications of discontinuous protophloem on meristem 
development can be considered to be a post-catastrophic scenario that 
is triggered and enhanced by repeated PPSE differentiation failure, 
and is difficult to recover from once phloem sap (and thus auxin) 
delivery is impaired'®. This complicates efforts to untangle cause and 
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corresponding loading controls (bottom). h, Auxin transport assays in 
X. laevis oocytes expressing the indicated heterologous plant proteins 
(n = 10 per time point; error bars: s.e.m.). i, Quantification of gap-cell 
frequency in PPSE strands. For PAX:: YFP-PAX(S596D), only long root 
seedlings (see j) were scored. pax alone was significantly different from 
PAX::PAX-CITRINE in pax (two-sided Fisher’s exact test, *P < 0.0001). 
PAX::YFP-PAX(S596D) was not significantly different from PAX::PAX- 
CITRINE in pax or from pax alone. j, Quantification of root phenotype 
classes in a PAX:: YFP-PAX(S596D) line (n = 50). 
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effect in the cellular action of regulators from the multicellular context. 
For example, it remained unclear whether pax mutants display PPSE 
differentiation defects because of inefficient BRX plasma membrane 
recruitment, or whether brx mutants display PPSE differentiation 
defects because of a failure to control PAX activity. To investigate 
whether BRX interaction with AGC kinases affects auxin transport, 
we tested the effect of BRX co-expression on kinase-mediated PIN acti- 
vation in oocytes. In these experiments, BRX substantially inhibited 
stimulation of auxin efflux by PAX or D6PK (Fig. 4a—c, Extended Data 
Fig. 5j, k). Because this inhibition was not observed in assays with the 
more distantly related PID (Fig. 4d), our findings suggest that BRX 
action affects a subset of related AGC kinases, and that its inhibitory 
effect is determined by kinase identity. 

The observation that BRX inhibits auxin efflux appeared particu- 
larly interesting in light of its known auxin-induced plasma-membrane 
dissociation!® (Fig. 4e, Extended Data Fig. 6a, b). By contrast, neither 
PAX abundance nor localization were affected by auxin (Fig. 4f). 
However, phosphoproteomics indicated auxin-induced phosphoryl- 
ation of phosphoserine $596 in the PAX activation loop (Extended 
Data Fig. 6c), which correlated with simultaneously increased PIN1 
phosphorylation (Extended Data Fig. 6d). In vitro, recombinant PAX 
phosphorylated PIN1 with comparably low efficiency, and S596 was 
dispensable for kinase activation (Fig. 4g, Extended Data Fig. 6e). A 
PAX(S596D) phosphomimic variant, however, was considerably more 
active than wild-type PAX and displayed increased phosphorylation 
activity towards PIN] (Fig. 4g, Extended Data Fig. 6e). Matching this 
biochemical observation, PAX(S596D) also stimulated auxin efflux 
considerably more in oocytes, to a level approximately equal to DoPK 
(Fig. 4h, Extended Data Fig. 6f). However, unlike wild-type PAX, the 
PAX(S596D) variant at best partially rescued the pax mutant (Fig. 4i, 
Extended Data Fig. 6g). Moreover, PAX(S596D) frequently triggered 
a gain-of-function phenotype of even shorter, often barely developing 
roots (Fig. 4j). Consistent with the D6éPK-like activity of PAX(S596D), 
D6PK expressed from the PAX promoter could not rescue the pax 
phenotype (Extended Data Fig. 6h, i). These findings suggest that 
fine-tuning of PAX activity is a feature of properly integrated PPSE 
development. 

In a parsimonious interpretation of our results, PAX and BRX act 
together as a molecular rheostat to modulate auxin efflux dynami- 
cally (Extended Data Fig. 7). In this scenario, PAX recruits BRX to 
the plasma membrane, which inhibits PIN-mediated auxin efflux at 
lower auxin levels. Because of this inhibition, cellular auxin increases 
until BRX eventually becomes displaced from the plasma membrane. 
Concomitantly, PAX is activated and stimulates auxin efflux. Reinforced 
through auxin-induced BRX transcription®'* (Extended Data Fig. 6j, k), 
this interplay could reach a dynamic steady-state equilibrium, which 
would impair higher local auxin activity in the multicellular context to 
properly time PPSE differentiation. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and investigators were not blinded to allocation during 
experiments and outcome assessment. Experiments were repeated two to four 
times. All attempts at replication were successful. 

Plant materials and growth conditions. The wild-type Arabidopsis line used in 
this study was the A. thaliana L. Heynh reference accession Columbia-0 (Col-0), 
which was also the genetic background for the mutants and transgenic lines. 
For plant tissue culture, seeds were surface-sterilized, stratified for 2 days in the 
dark at 4°C, and germinated in vertically placed Petri dishes on 0.9% agar and 
0.5 x Murashige and Skoog (1/2 MS) medium (Duchefa) with 0.3% sucrose 
at 22°C under continuous light. The mutant pax-1 and pax-2 alleles (T-DNA 
insertion lines SAIL_688_B04 and GABI_274F04, respectively) were identified 
from available collections and obtained from the Nottingham Arabidopsis Stock 
Centre. The following transgenic and mutant lines have been described elsewhere: 
BRX::BRX-CITRINE™, PIN1::PIN1-GFP”°, PIN3::PIN3-GFP”, PIN7::PIN7- 
GFP?’, CVP2::NLS- VENUS", 35S::DII-NLS-VENUS and 35S::mDII-NLS- 
VENUS*, brx-2', ops-2!”, pin1-613"*, aux1-7°, d6pk, dopkl1, dépkl2, d6pkl3, as 
well as their d6pk012 triple and d6pk0123 quadruple mutants!”. Primers used for 
genotyping are summarized in Extended Data Table 1. 

Constructs and generation of transgenic lines. Transgenes for plant transforma- 
tion were created in suitable binary vectors and produced using standard molecular 
biology procedures and/or the NEBuilder HiFi DNA Assembly Reaction Protocol. 
The 35S::mDI- VENUS and 35S::DIU- VENUS lines in Col-0 and brx-2 backgrounds 
have previously been described®. For the CVP2::MP* or CLE45::MP* constructs, a 
2.6-kb genomic promoter fragment upstream of the initiation codon of the CVP2 
gene’® or a 2.0-kb genomic promoter fragment upstream of the initiation codon 
of the CLAVATA3/EMBRYO SURROUNDING REGION 45 (CLE45) gene! was 
amplified, combined with amino acids 1-794 of the MP open reading frame!° 
and introduced into the pCAMBIA1305.1 binary vector. To generate the trans- 
lational PAX::PAX-CITRINE fusion, the PAX promoter region (4.5-kb upstream 
of the ATG start codon) was amplified and cloned into pDONR P4P1R as well 
as a genomic fragment of the PAX transcript region without a STOP codon into 
pDONR 221. The entry clones together with CITRINE in pDONR P2RP3 were 
cloned into the destination vector pH7m34GW by the multisite Gateway recom- 
bination system. To create UBQ10::PAX-CITRINE, the entry clones containing the 
UBQ10 promoter in pDONR P4P1R, the PAX coding sequence without a STOP 
codon in pDONR 221, and the CITRINE coding sequence in pDONR P2RP3 
were combined into binary vector pH7m34GW. The binary constructs were intro- 
duced into Agrobacterium tumefaciens strain GV3101?“?™ and transformed into 
the pertinent Arabidopsis genotypes using the floral dip method. For recombinant 
expression of the glutathione-S-transferase N-terminally tagged fusion proteins 
GST-PAX, GST-PAX(S596D) and GST-PAX(S596A), a Gateway-compatible attB- 
flanked PCR product of PAX coding sequence was amplified from cDNA and 
cloned into the Gateway-compatible donor vector pDONR 201 (Invitrogen). The 
mutagenesis leading to PAX(S596D) phosphomimetic or PAX(S596A) phospho- 
mutant variants in the activation loop of the PAX coding sequence was achieved 
using site-directed mutation PCR on a pDONR 201 entry clone carrying the PAX 
coding sequence insert. The resulting pDONR 201 entry clones served as substrate 
to recombine the PAX coding sequences into the pDEST15 (Life Technologies) 
destination vector that was ultimately used for recombinant protein expression. 
The expression vectors pDEST15 containing GST-D6PK and GST-PINI cytosolic 
loop have previously been described'?. Primers used for cloning are summarized 
in Extended Data Table 1. 

Microscopy. To visualize reporter genes and staining signals, fluorescence for 
CITRINE (excitation 514 nm, emission 529 nm), VENUS (excitation 515 nm, 
emission 528 nm), propidium iodide (PI) (excitation 536 nm, emission 617 nm), 
Alexa Fluor 488 (excitation 498 nm, emission 520 nm) and Alexa Fluor 546 (excita- 
tion 556 nm, emission 573 nm) were detected in seedlings examined under Zeiss 
LSM 700 or 710 inverted confocal scanning microscopes. Pictures were taken 
with 20x or 40x water/oil immersion objectives. PI staining of seven-day-old 
seedlings was used for quantification of protophloem cell size. For presentation, 
composite images had to be assembled in various instances. Sequential scanning 
was used for co-localization studies to avoid any interference between fluorescence 
channels. For image analyses, ImageJ (NIH; https://rsb.info.nih.gov/ij), Zeiss Zen 
(black edition) and Imaris software were used. If necessary, images were processed 
using the ‘sharper tool for clearer visualization of cellular organization. For signal 
quantifications, all samples were analysed in the same area of the root meristem, 
and the average signal intensity per transgenic line was calculated as the mean of 
means. Statistical significance was evaluated with Student’ t test. 
Pharmacological and hormonal treatments. For treatments, 5-7-day-old seed- 
lings were either grown on, or transferred either onto solid or into liquid 1/2 MS 
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medium with or without the chemicals and incubated for the indicated time. Drugs 
and hormones used were as follows: BFA (dissolved in DMSO), IAA (dissolved in 
DMSO), t-kynurenine (dissolved in DMSO), PI (1 mg ml~! in water, diluted 1:25). 
VISUAL assay and proteomics. The VISUAL protocol! was performed as pre- 
viously described!” with subsequent BRX-CITRINE or YFP pull down. In brief, 
cotyledons of six-day-old transgenic Arabidopsis seedlings were cultured for 3 days 
in induction medium and subsequently ground in extraction buffer. Supernatants 
(4 mg of total protein extract in two technical replicates) were incubated with 
anti-GFP beads (GFP-Trap_MA, Chromotek). Beads were magnetically separated 
and protein was eluted in 2x SDS sample buffer, loaded on an SDS-PAGE gel for 
electrophoresis, and subsequently analysed by liquid chromatography with tandem 
mass spectrometry. 

Protein immunolocalization. Whole mount immunolocalization on five-day-old 
seedlings was performed as described previously’. The primary antibody dilutions 
were: 1:600 for anti-GFP mouse (Roche), 1:500 for anti-BRX rabbit (this study), 
1:100 for anti-PIN1 S1-P rabbit!’, 1:300 for anti-PIN1 S4-P rabbit'! and 1:100 for 
anti-PIN1 goat (Santa Cruz Biotechnology). The secondary antibody dilutions 
were: 1:600 for Alexa Fluor 488 anti-mouse (Molecular Probes) and 1:600 for Alexa 
Fluor 546 anti-rabbit (Molecular Probes). The anti-BRX antibody was obtained 
by custom antibody production directed against the keyhole-limpet hemocyanin 
(KLH)-conjugated BRX peptide GGSSNYGPGSYHGGC with affinity purification 
(Agrisera). 

Oocyte experiments. Auxin transport assays in X. laevis oocytes were carried 
out as described!?°", The oocytes were obtained from the animal facility of the 
Technical University of Munich, Department of Nutritional Physiology. The 
animals were kept in accordance with local guidelines and regulations. To mon- 
itor expression levels, post-assay immunoblots were performed with anti-PIN1 
sheep and anti-PIN3 sheep primary antibodies (Nottingham Arabidopsis Stock 
Centre, used at 1:5,000 dilution), and anti-GFP rabbit (custom antibody"!, 
1:2,000 dilution). The secondary antibodies were anti-sheep from donkey 
(1:5,000, Santa Cruz Biotechnology) and anti-rabbit from goat (1:10,000, Santa 
Cruz Biotechnology). 

Protoplast transformation. Protoplasts were isolated from Arabidopsis suspension- 
cultured cells (Col-0) five to seven days after subculturing by incubation of 2 g 
of cell culture with 1% Cellulase R-10 (SERVA) and 0.25% Macerozyme R-10 
(SERVA). Typically, protoplasts were transformed with 20 j1g of plasmid DNA 
using polyethylene glycol-mediated transformation and analysed after 16 to 20h 
of incubation as described”*. 

In vitro kinase assay. The in vitro kinase assay was performed using recombinant 
glutathione-S-transferase N-terminally tagged fusion proteins GST-PIN1? CL 
(cytosolic loop), GST-D6PK™, GST-BRX!8, GST-PAX, GST-PAX(S596D) and 
GST-PAX(S596A), expressed in the Escherichia coli strain BL21(DE3) and purified 
using Glutathione Sepharose 4B (GE Healthcare). The kinase reactions were 
performed by incubating the purified GST-fusion proteins for 60 min at 28°C 
in the kinase reaction buffer (25 mM Tris HCl pH 7.5, 5 mM MgCh, 0.2 mM 
EDTA, 1x cOmplete protease inhibitor cocktail (Roche)), supplemented with 
10 Ci [y-*P] ATP (370 Mbgq, specific activity 185 Tbq, Hartmann Analytics). The 
reactions were stopped by boiling the protein samples mixed with 5 x concentrated 
Laemmii buffer for 10 min. Subsequently, the protein mixtures were separated by 
SDS-PAGE. After samples had been run, the SDS-PAGE gel was vacuum-dried 
and used for autoradiography. The same gel was later rehydrated and stained with 
Commassie Brilliant Blue to serve as a loading control. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. The datasets displayed in the current study are available from 
the corresponding authors upon reasonable request. For gel source images, see 
Supplementary Fig. 1. 
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Extended Data Fig. 1 | Overview of protophloem development. 

a, Illustration of protophloem development from the stem cell to the 
mature sieve element in the Arabidopsis root meristem. b, Illustration 
of a cross section through the stele of an Arabidopsis root meristem, 
highlighting the arrangement of the two sieve element strands and the 
xylem axis. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | Auxin activity in developing PPSEs. a, Confocal 
microscopy of the inverse auxin activity reporter DII- VENUS and its 
negative control mDII- VENUS (yellow fluorescence) in the root meristem 
(PI staining, red) of wild-type Col-0 plants. Asterisks indicate sieve 
element cell files. b, Confocal microscopy of constitutively expressed 
DII-VENUS in developing PPSEs and neighbouring cell files. Left, PI 
cell-wall staining (red); middle, DII- VENUS fluorescence (yellow; PPSE 
nuclei marked with red circles, nuclei in neighbouring cell files with blue 
circles); right, overlay. c, As in b, for mDII- VENUS. d, Relative intensity 
of the DII- VENUS reporter and its mDII-VENUS control in the nuclei of 
Col-0 PPSEs as compared to the nuclei of directly neighbouring cells. The 
statistically significant difference between DII-VENUS and mDI-VENUS 
in the PPSE/neighbours group is indicated (two-sided Student's t-test; a, 
P=5.86 x 10~'1). e, Cumulative average cell length in different root cell 
files, starting from the respective first stem-cell daughters (cell #1) 

(n = 11 wild-type Col-0 roots). f, Number of developing PPSEs from the 
first stem-cell daughter up to the first transition zone PPSE (protophloem 
length) in seven-day-old Col-0 seedlings, and transgenic seedlings 
expressing a constitutively active derivative of the auxin response factor 
MONOPTEROS (MP*) under control of the PPSE-specific CVP2 
promoter. a, P= 3.16 x 10~°; two-sided Student's t-test. g, Cumulative 
average cell length in the developing protophloem, starting from the first 
stem-cell daughter (cell #1) (n = 23 each). Elongation occurs prematurely 
in CVP2::MP* plants. h, Confocal microscopy of a brx root meristem, 
focused on one of the sieve element strands (asterisk). Arrowheads point 
out gap cells, which fail to build up the characteristic PPSE cell wall owing 
to a failure to differentiate. i, Relative intensity of the DII- VENUS reporter 
and its mDII-VENUS control in the nuclei of Col-0 and brx PPSEs as 


compared to nuclei of cells in directly neighbouring files. Statistically 
significant differences between PPSE/neighbours and neighbour/ 
neighbour in the Col-0 and brx DII- VENUS groups are indicated (two- 
sided Student's t-test; a, P= 2.49 x 10-7; b, P= 0.026). j, Coefficient of 
variance for fluorescence traces of the DII-VENUS reporter and its mDII- 
VENUS control (left) and PI staining (right) along protophloem cell files. 
The statistically significant difference in VENUS fluorescence in the brx 
group is indicated (two-sided Student's t-test; a, P= 2.30 x 1077). 

k, Quantification of PPSE strands with gaps in roots of indicated 
genotypes. 1, Root length in seven-day-old seedlings for indicated 
genotypes. The statistically significant differences between CVP2::MP“ 
in brx and brx alone (P = 0.0017) and between CVP2::MP* in brx and 
CLE45::MP* in brx (P = 0.0052) are indicated by the character a. 

m, Distribution of gap size in protophloem strands of seven-day-old 
seedlings with gaps of indicated genotypes. The statistically significant 
differences between CVP2::MP* in brx and brx alone (P = 0.0008) 

and between CVP2::MP° in brx and CLE45::MP% in brx (P = 0.0051) 

are indicated by the character a (two-sided y? test). n, Expression of 
fluorescent NLS- VENUS reporter in PPSEs of brx mutants, driven by 
either CVP2 or CLE45 promoter. Arrowheads indicate gap cells. 

0, p, Expression of CVP2::NLS- VENUS reporter (green fluorescence) in 
PPSE cell files (asterisks) of six-day-old Col-0 root meristems (PI staining, 
white) grown in the presence of (0), or transferred for 48 h onto (p), 
increasing amounts of the auxin biosynthesis inhibitor L-kynurenine 
(L-kyn). On the higher concentration, PPSE cell files (magnified) were 
barely distinguishable. q, Confocal microscopy of seven-day-old root 
meristems (PI staining, red). Asterisks indicate sieve element cell files 
(magnified, barely distinguishable in aux1 brx). 
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Extended Data Fig. 3 | Identification of BRX interactors. a, Induction 
of BRX expression in cotyledons in the VISUAL transdifferentiation 
assay, as indicated by a BRX::GUS reporter gene. b, Visualization of 
successful tracheary element differentiation using polarized light 


microscopy. ¢, Western analysis of BRX-CITRINE fusion protein after 
immunoprecipitation. d, List of the top BRX interactors, indicating the 
number of peptides isolated as compared to controls. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a i b 
40 a 3 Minone Mone MM both strands 
: - a | 100 
a a ° | 
Eom | B a rs a; O78 | 
€ T | xe 
5 g 
D 
5 S 
Ss M™Col-0 vs HM mutant a 
fe} peg 
4 Qa 
n=23 34 25 32 23 34 19 29 21 34 21 34 24 23 i 
d6pk = d6pki1 d6pkl2 d6pki3_ d6pkO1 d6pk012 d6pk0123 06 x - 
oS PAX::PAX-CITRINE 
1S) in pax, several lines 
e : ; f 100 
line tip delay / no % w/ tip | Phloem-mediated CFDA cotyledon to 
signal signal signal root tip transport ‘ 
ae 
a 
ColO 22 8 30 73% e 
s a 
Ww 
o 
brx 5 30 35 14% a 
= 
tl 
cl 
pax 14 12 26 53% One se Se 
5 a 8 
(2) 
50 uM BFA 50 uM BFA es 
J) gc 
Eo 
S } 
gL . I a a w 
\ ; a 
S oocytes \ —T 9 S < 
2 expressing \ : = % = 
c = Q ry 
—% (MPINt S a fc) Ns 
@ M@PIN1+PID 5 © BS 
@ MIPIN1 + D6PK = ic) x 
@ PIN1 + PAX zt 
IPIN1 + PAXL 7 
Ono 0 7.5 min, 15 30 mi 
min. fom vn. mock 1h 2h 3h mock 1h 2h 3h 60 min 
fe} YFP RFP brightfield 
YFP-PAX 
n=24 ¢ 


BRX-RFP. 
n=45 


YFP-PAX 
+ 


BRX-RFP 
n=34 


MH both strands 
Mone strand 


Bnone 
100 


YFP-PAXK/R>A 
n=58 


PPSE gaps, % 


YFP-PAXK/R>A 
+ 


BRX-RFP. 
n=47 


) = 
mock 5 uM 10 uM 


10 uM BFA 


Extended Data Fig. 4 | See next page for caption 
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Extended Data Fig. 4 | Phenotypic analysis of pax-related mutants and 
transgenic lines. a, Root length in seven-day-old seedlings for indicated 
mutants and parallel Col-0 controls. Statistically significant differences 
between Col-0 and mutants are indicated (Student’s t-test, two-sided; a, 

P < 0.02). b, Quantification of gap-cell frequency in protophloem strands 
of six-day-old seedlings. Statistically significant differences are indicated 
(two-sided Fisher’s exact test; a, pax versus Col-0; b, others versus pax; 

all P values < 0.001). c, Root length in seven-day-old seedlings for Col-0, 
pax and transgenic lines in the pax mutant background that expressed 
PAX under the control of its native promoter or the BRX promoter. The 
statistically significant difference between pax and Col-0 is indicated 
(two-sided Student's t-test; a, P = 0.00016). d, Quantification of gap-cell 
frequency in protophloem strands of six-day-old seedlings. Statistically 
significant differences are indicated (two-sided Fisher's exact test; a, 

pax versus Col-0; b, others versus pax; all P values < 0.001). e, Phloem- 
mediated translocation of carboxyfluorescein diacetate succinimidyl ester 
(CFDA) dye (green fluorescence) into the phloem-unloading zone of the 
root tip 45 min after CFDA application to the cotyledons of four-day-old 
seedlings, and corresponding classification of CFDA signal at the end of 
the experiment. f, Quantification of gap-cell frequency in protophloem 
strands of six-day-old seedlings. Statistically significant differences are 
indicated (two-sided Fisher’s exact test; a, others versus Col-0; b, Col-0 and 
pax versus brx; all P values < 0.01). g, Quantification of gap-cell frequency 
in protophloem strands of six-day-old seedlings. Statistically significant 
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differences are indicated (two-sided Fisher’s exact test; a, others versus 
Col-0, all P values < 0.001). h, Auxin transport assays performed in X. 
laevis oocytes expressing the indicated heterologous plant proteins (n = 10 
oocytes per time point; error bars, s.e.m.). i, BFA control experiments. 
Accumulation of PIN1-GFP fusion protein in BFA compartments (left), 
and comparative BFA insensitivity of OPS-GFP fusion protein (right). 

j, Dissociation of PAX-CITRINE and BRX-CITRINE fusion proteins 
from the plasma membrane in response to 5 1M BFA treatment. 

k, Quantification of PAX-—CITRINE and BRX-CITRINE fluorescence 
signal at the plasma membrane in response to 5 tM BFA treatment, 
normalized to allow direct comparison (means of approximately ten 
cells per root). 1, Confocal microscopy of six-day-old PI-stained root 
meristems grown on mock or low BFA concentration as indicated. 
Asterisks indicate PPSE cell files and arrowheads indicate gap cells. 

m, Quantification of gap-cell frequency in PPSE strands of roots shown 
in (1). Statistically significant differences are indicated (two-sided 
Fisher’s exact test; a, others versus mock, P < 0.0001). n, Expression of 
PAX-CITRINE fusion protein under its native promoter, in pax single or 
brx pax double mutants. 0, Transient expression of the indicated fusion 
proteins, alone or in combination, in Arabidopsis protoplasts. 

The PAX*/®>4 variant carries point mutations in a polybasic stretch that 
is required for plasma membrane interaction‘, The average number of 
patches per protoplast is indicated. 
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Extended Data Fig. 5 | PIN activity in the root protophloem. 

a, Confocal microscopy of indicated reporter genes (green fluorescence) 
in the root meristem (PI staining, red) of Col-0 wild-type plants (top), 

and magnification without PI background (bottom). Asterisks indicate 
sieve element cell files. b, Immunolocalization of nuclear localized NLS- 
VENUS (green) expressed under control of PPSE-specific CVP2 promoter, 
and PIN1 (red) by antibody staining. Asterisks indicate PPSE cell files. 

c, Simultaneous immunolocalization of CVP2-driven NLS- VENUS 
(green) with different anti-PIN1 antibodies that specifically detect 
phosphorylated PIN1 residues $231 (J231) or $271 (J271). 

d, Quantification of the J231 phosphosite signal intensity (means from 
approximately ten cells per root, arbitrary units). The statistically 
significant difference is indicated (two-sided Student's t-test; a, 

P=1.2 x 10-°). e, Quantification of the J271 phosphosite signal intensity 
(means from approximately ten cells per root, arbitrary units). f, Immuno- 
localization of PIN1, and the J231 and J271 PIN1 phosphosites (red) in 
brx (left) or ops (right) by antibody staining, with an OPS::BRX-CITRINE 
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or CVP2::NLS-VENUS reporter in the background for the identification 
of PPSE cell files (asterisks). g, Quantification of gap-cell frequency in 
protophloem strands of six-day-old seedlings for the indicated genotypes. 
Statistically significant differences are indicated (two-sided Fisher’s exact 
test; a, Col-0 and pin1 versus others, P < 0.0001; b, brx or pax single 
mutant versus brx pin or pax pin1 double mutants, P < 0.02). 

h, Confocal microscopy of representative six-day-old Col-0, pin1, and 
brx root meristems (PI staining, white). Asterisks indicate PPSE cell files. 
i, Different phenotypic classes occurring in brx pin1 double mutant root 
meristems (PI staining, white). PPSE cell files were frequently barely 
distinguishable or missing. j, Auxin transport assays performed in X. 
laevis oocytes expressing the indicated heterologous plant proteins (n = 10 
oocytes per time point; error bars, s.e.m.). k, Western blot analysis of the 
oocytes used in j, demonstrating that BRX expression does not interfere 
with D6PK or PIN1 expression or stability (detection of YFP-D6PK and 
PIN1 with anti-GFP and anti-PIN1 antibodies, respectively). 
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Extended Data Fig. 6 | BRX auxin response and PAX acne plant proteins (n = 10 oocytes per time point; error bars, s.e.m.). g, Polar 
a, b, Response of BRX-CITRINE fusion protein to treatment with 1 1M localization of the YFP-PAX(S596D) variant in developing PPSEs of a pax 
or 10 »M auxin (NAA), time course experiment (b) with quantification mutant. h, Quantification of gap-cell frequency in protophloem strands of 
(a, means from approximately ten cells per root, arbitrary units). seven-day-old pax mutant seedlings that express a D6PK-CITRINE fusion 
Statistically significant differences are indicated (two-sided Student’s protein under the control of the PAX promoter. The statistically significant 
t-test; a, mock versus others, P < 0.0094; b, 1 {1M versus 10 1M auxin, difference is indicated (two-sided Fisher’s exact test; a, Col-0 versus 
P < 0.0028). c, Phosphoproteomics of auxin-treated seedlings, showing pax and transgenic lines, P < 0.0001). i, Polar localization of D6PK- 
normalized abundance of a conserved phosphosite in PAX, D6PK, CITRINE fusion protein in developing PPSEs of a pax mutant. j, k, Auxin 
D6PKL1-3, and AGC1-6, with subfragments indicated in different colours. _ induction of BRX transcription in developing PPSE cell files (asterisks) 
d, Same as c, for a PIN1 phosphosite. e, Radioactive in vitro kinase assays visualized using an NLS-3 x VENUS reporter gene (j), with corresponding 
with GST fusion proteins of D6PK, PAX, or PAX(S596A) and PAX(S596D) —_ quantification of nuclear fluorescence signal (k). Statistically significant 
point mutants, with BRX or the PIN1 cytosolic loop as substrate (top) differences are indicated (one-sided Student's t-test; a, versus preceding 
and corresponding loading controls (bottom). f, Auxin transport assays time point, P < 0.0153). 


performed in X. laevis oocytes expressing the indicated heterologous 
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Extended Data Fig. 7 | Molecular rheostat model for PAX-BRX action 
in the regulation of auxin efflux. Proposed model for the cellular action 
of PAX and BRX as elements of a molecular rheostat. BRX interacts with 
PAX at the plasma membrane, where it inhibits PIN-mediated auxin efflux 
at lower auxin levels. Because of reduced PIN-mediated auxin efflux, 
cellular auxin levels increase so that, eventually, BRX becomes displaced 
from the plasma membrane. Concomitantly, PAX becomes activated and 
increasingly stimulates auxin efflux. Reinforced through auxin-induced 
BRX transcription and decreasing cellular auxin levels, BRX can return to 
the plasma membrane and again inhibit auxin efflux. This interplay would 
lead to a dynamic steady-state equilibrium that fine-tunes auxin levels 
along a cell file. 
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Extended Data Table 1 | List of oligonucelotides used in this study 
Genotyping T-DNA insertion lines 5'>3' 


LB3 Sail 

LB Gabi 08474 
LBb1.3 Salk 
brx-2_F417 
brx-2_R650 
brx-2_ski3 
pax_Sail_RP 
pax_Sail_LP 
pax_Sail_ LP_UTR 
pax_GK_LP 
pax_GK_RP 
d6pk LP 

d6pk RP 
d6pkI1 LP 
d6pkI1 RP 
d6pkl2 LP 
d6pkl2 RP 
d6pk|3 LP 
d6pkI3 RP 


pin1_SALKO47613_F 
pin1_SALKO47613_RP 


Cloning 5'>3' 
attB1_PAX 
attB2_PAX 
attBir_prPAX 
attB4_prPAX 
prcvP2 
prcvP2 
prcL—45 
prcL—45 
mpd-2 

mpd-2 
attB1_PAX 


attB2_PAX with stop codon 


PAX_S596Dmut 
PAX_S596Amut 


TAGCATCTGAATTTCATAACCAATCT CGATACAC 
ATAATAACGCTGCGGACATCTACATTTT 
ATT TTG CCG ATT TCG GAA C 
GTCAGTGTTTGCTTCCTCTCTATG 
TATTTCCTTGTCTAGGTAAGAATCC 
TGATCCATGTAGATT TCCCGGACATGAA 
ATAGTAGCGGCCTAAGCGAAG 
CTGCATAGCTAGTTGCTGGTG 
CTACTACGGAACCAAATCTTGG 
GTACTCCGAGAAATGCTTCCC 
AAATTGGAGCTGCTGATGATG 

CAA GAT CAA CCG GTT TAG GAA TCT 
GGC TCA AACTGA AAG AGA GAT ATT GC 
CATTTCCATGGAAGGAAGGTGATGAGCT 
ACTCCAGAACCATACTTCGAGGCCAT 
CTTCGCCTTTGATGATCTCTG 
AGTGACGAGAGTAGCTGCAGC 
CGGACACTAATCGGATCGGA 
GGCAAGAAAGGATGATCTAACG 
CAAAAACACCCCCAAAATTT CTCT 
AATCATCACAGCCACTGATCC 


GGGGACAAGTTTGTACAAAAAAGCAGGCTTTATGCT GGAAATGGAAAGAGTT GC 
GGGGACCACTTTGTACAAGAAAGCT GGGTT GAAAAACT CAAAGT CTAGAAATTTACC 
GGGGACTGCTTTTTTGTACAAACTTGTTCATGCGATTTTTAACACCAAACAC 
GGGGACAACTTTGTATAGAAAAGTTGTTCGGACCAATT CATT CTTCAGAC 
CGCAGAGCTTGAGTAAATCAAG 
AACACAAGACAATGAAGCCATCATTGTTGCTTCTTCTCTGCAAGTG 
CATCAAAGATCTGATTGGTCACATC 
AACACAAGACAATGAAGCCATCATTTCTGCTCTTAGGCAGACAAGA 
ATGATGGCTTCATTGTCTTGTGTT 

TGGTCACCAATTCACACGTGTTAGGTT CGGACGCGGGGTG 
GGGGACAAGTTTGTACAAAAAAGCAGGCTTAATGCT GGAAATGGAAAGAGTTG 
GGGGACCACTTTGTACAAGAAAGCT GGGTATTAGAAAAACT CAAAGTCTAGAAATTTACC 
[Phos]-GGGTTCCAACAAAGTCCATGGACCGTG 
[Phos]-GGGTTCCAACAAATGCCATGGACCGTG 
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Cancer immunotherapy based on genetically redirecting T cells 
has been used successfully to treat B cell malignancies!~*. In this 
strategy, the T cell genome is modified by integration of viral vectors 
or transposons encoding chimaeric antigen receptors (CARs) that 
direct tumour cell killing. However, this approach is often limited 
by the extent of expansion and persistence of CAR T cells*°. Here we 
report mechanistic insights from studies of a patient with chronic 
lymphocytic leukaemia treated with CAR T cells targeting the CD19 
protein. Following infusion of CAR T cells, anti-tumour activity was 
evident in the peripheral blood, lymph nodes and bone marrow; this 
activity was accompanied by complete remission. Unexpectedly, at 
the peak of the response, 94% of CAR T cells originated from a single 
clone in which lentiviral vector-mediated insertion of the CAR 
transgene disrupted the methylcytosine dioxygenase TET2 gene. 
Further analysis revealed a hypomorphic mutation in this patient's 
second TET2 allele. TET2-disrupted CAR T cells exhibited an 
epigenetic profile consistent with altered T cell differentiation and, 
at the peak of expansion, displayed a central memory phenotype. 
Experimental knockdown of TET2 recapitulated the potency- 
enhancing effect of TET2 dysfunction in this patient’s CAR T cells. 
These findings suggest that the progeny of a single CAR T cell 
induced leukaemia remission and that TET2 modification may be 
useful for improving immunotherapies. 

Here we describe an unusual case in which CAR T cell therapy was 
used to treat chronic lymphocytic leukaemia (CLL) that sheds light on 
the determinants of CAR-T cell efficacy and persistence. A seventy- 
eight-year-old man with advanced relapsed/refractory CLL (Patient-10, 
Supplementary Table 1) enrolled in a clinical trial for CD19 CAR 
T cell (CTLO19) therapy (trial no. NCT01029366). He underwent two 
adoptive transfers of autologous CTLO19 cells, approximately two 
months apart. Following the first infusion, he became persistently 
febrile and was diagnosed with cytokine release syndrome (CRS). 
Signs of CRS rapidly resolved following administration of interleukin 
(IL)-6 receptor-blocking therapy. Patient-10 continued to show pro- 
gressive leukaemia six weeks after receiving his first dose of CAR 
T cells (Fig. la-c). 

Because there was a concern that early blockade of IL-6-mediated 
signalling may have diminished the response to CAR T cell therapy, 
this patient was retreated with the remainder of his CAR T cells 70 days 
after the first dose (Supplementary Table 1). Infusions were again com- 
plicated by CRS, but this resolved without anti-IL-6 receptor-blocking 


intervention. Evaluation of the patient's bone marrow one month 
later revealed extensive infiltration of CLL (Extended Data Fig. 1), 
and computed tomography (CT) scans showed minimal improve- 
ment in extensive adenopathy. Unexpectedly, two months after the 
second infusion, the expansion of CAR T cells peaked in the periph- 
eral blood, followed by contraction (Fig. 1a). CTL019 cell outgrowth 
occurred mostly in the CD8* T cell compartment, which is typical in 
patients with CLL who respond to CAR T cell treatment (Extended 
Data Fig. 2a). Delayed CAR T cell expansion was accompanied by 
high-grade CRS and elevated circulating levels of interferon (IFN)-7, 
granulocyte-colony stimulating factor (G-CSF), IL-6, IL-8 and IL-10 
(Fig. 1b). Coincident with the onset of high fever, rapid clearance of 
CLL was observed (Fig. 1c, d). Next-generation sequencing of rear- 
rangement products at the immunoglobulin heavy chain (IGH) locus 
showed a 1-log reduction in tumour burden 51 days after the second 
infusion, with complete eradication of this tumour clone from the 
blood one month later (Supplementary Table 2). CT scans showed a 
marked improvement in mediastinal and axillary adenopathy (69% 
change; Fig. 1d). Patient-10 achieved a complete response with no evi- 
dence of CLL in his marrow (Extended Data Fig. 1, Supplementary 
Table 2) and resolution of all abnormal adenopathy six months later 
(Fig. 1d and data not shown). His most recent long-term follow-up 
evaluation (after more than 4.2 years) revealed the presence of CAR 
T cells in the peripheral blood, ongoing B cell aplasia (Extended Data 
Fig. 2b-e) and no evidence of circulating disease or marrow infiltration 
(Extended Data Fig. 1). Immune cell populations in the blood were 
normal in frequency, with no observed signs of lymphoproliferative 
abnormalities (Extended Data Fig. 2f and data not shown). The patient 
remains well in complete remission that has been sustained for more 
than five years at the time of this report. 

Deep sequencing of the T cell receptor beta repertoire indicated that 
pre-infusion CD8* CTLO19 cells and the peripheral blood CD8 T cell 
compartment one month after the second infusion were polyclonal, 
with multiple distinct TCRV6 clonotypes that were similar between the 
samples (Fig. 2a, Extended Data Fig. 3a). Approximately two months 
after the second infusion, TCRV5.1 family usage exhibited a skew- 
ing of greater than 50%, with clonal dominance occurring in CD8T 
CTLO19 cells (Fig. 2a, b). Subsequent analysis revealed that 94% of the 
CD8* CAR T cell repertoire consisted of a single clone that was not 
detected at the time of transfer or one month after the second infusion 
(Fig. 2c). The expansion of this clonal population of cells declined in 
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Fig. 1 | Evaluation of clinical responses following adoptive transfer of 
CAR T cells in a patient with CLL. a, In vivo expansion and persistence 
of CAR T cells. b, Longitudinal measurements of serum cytokines before 
and after CAR T cell infusions. ¢c, Total number of circulating B-CLL 


line with CAR T cell decay kinetics (Fig. 2d). Thus, leukaemia was 
eliminated in this patient primarily by the progeny of a single CAR 
T cell that demonstrated massive in vivo expansion (approximately 
29 population doublings in the peripheral blood). 

Longitudinal analysis of blood samples from Patient-10 revealed 
a highly abundant cell clone with an integration site in intron 9 of 
TET2, which was expanded in CAR T cells at the peak of clinical 
activity and not at earlier time points (Fig. 3a). This large degree of 
T cell clonal dominance has not been observed in more than forty 
leukaemia patients treated with CD19-directed T cells at the University 
of Pennsylvania, as determined by lentiviral integration site analysis 
(Extended Data Fig. 3b-d and data not shown). In CLL and acute 
lymphblastic leukaemia (ALL), accumulation of CAR T cells in vivo 


cells before and after CTLO19 therapy. d, Sequential CT imaging of 
chemotherapy-refractory lymphadenopathy. Red arrows indicate masses 
that were progressively reduced following the second CAR T cell infusion. 


results from the expansion of a diverse polyclonal or pauciclonal reper- 
toire within the transduced T cell population®. In Patient-10, cells bear- 
ing the TET2 integration event were present in the blood at a relative 
abundance of 14% at 4.2 years after the infusion (Fig. 3a). The clonal 
population thus contracted, with no signs of insertional oncogenesis. 
Because TET2 is a tumour suppressor gene, we are continuing to 
monitor this patient carefully. 

TET2 is a master regulator of blood cell formation. Haploinsuffciency 
or deletion of TET2 is found in normal clonal haematopoiesis’ as well as 
the initiation of lymphoma and leukaemia, including naturally arising 
and human T-lymphotropic virus type 1 (HTLV-1)-associated 
malignancies*"'!. Although TET2 inactivation may contribute to 
increased self-renewal of haematopoietic stem and progenitor cells, its 
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Fig. 2 | Analysis of CAR T cell clonal expansion in a patient with 

CLL who had a delayed therapeutic response. a, Frequency of TCRVB 
gene segment usage in the blood of Patient-10 one month (left) and 

two months (middle) after the second CAR T cell infusion. TCRVB 
clonotype frequencies in sorted CAR T cells at the peak of expansion are 
also shown (right). b, Flow cytometric proportions of TCRV5.1- versus 
TCRV613.1-positive (negative control) CAR T cells. c, Abundance of 
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different TCRV85.1 clones in CAR* T cells at the peak of activity relative 
to other time points. Red dots indicate the dominant TCRV85.1 clone 
(representative of two independent experiments). d, Expansion kinetics 
of the TCRVB5.1+ dominant clone following a second infusion of CAR 

T cells plotted in parallel with CTLO19 levels. Percentages of positive cells 
were calculated from the results of a quantitative PCR assay designed to 
amplify clonotype-specific sequences. 
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Fig. 3 | Investigation of CAR lentiviral integration sites and TET2 
dysfunction in Patient-10. a, Longitudinal CAR T cell clonal abundance 
as marked by integration sites. Different colours (horizontal bars) indicate 
major cell clones. A key to the sites, named for the nearest gene, is shown 
below the graph (abundances below 3% binned as ‘Low abundance’). 

b, Diagram of the vector at the TET2 integration site locus illustrating 
splicing into the vector provirus to yield truncated transcripts. Asterisks 
denote ectopic stop codons. LTR, long terminal repeat; cPPT, polypurine 
tract; EFla, elongation factor 1-« promoter. c, Domain organization of 
TET2 and location of a residue mutated in CAR* and CAR T cells from 
Patient-10. Cysteine (Cys)-rich and catalytic double-stranded 3-helix 


disruption alone infrequently leads to overt oncogenesis!®'”. Analysis 
of polyadenylated TET2 RNA populations in clonal CAR T cells at the 
peak of expansion in Patient-10 showed the appearance of new chimae- 
ric RNAs that spliced from TET2 exon 9 into the vector and terminated, 
truncating the encoded protein with premature stop codons (Fig. 3b, 
Extended Data Fig. 4a, b). Although it is possible that expression of the 
truncated fusion TET2 protein may have a dominant-negative effect, it 
has been demonstrated that other TET2 mutant proteins do not exhibit 
such characteristics’. 

We next sequenced CAR* and CAR‘ T cells from this subject and 
examined genes involved in haematologic malignancies (Extended 
Data Fig. 4c). In both samples, a missense variant in TET2 encoding 
amino acid 1879 was found in the catalytic domain, converting the 
wild-type residue, glutamic acid, to glutamine (Fig. 3c and data not 
shown). The c.5635C mutation was present in the allele of TET2 
without the integrated CAR transgene, as that allele contained the 
wild-type reference sequence (c.5635G; Extended Data Fig. 4d). 


(DSBH) domains are also shown (red arrow denotes an amino acid 
change resulting from a missense mutation). Representative results of next 
generation sequencing appear beneath the structural diagrams. Each grey 
bar denotes a DNA fragment. A single-base G/C nucleotide substitution 

is highlighted by dashed lines above the consensus sequence. d, Diagram 
of the TET2-catalysed sequential oxidations of 5-mC to 5-hmC and to 
5-fC and 5-caC (top). Genomic levels of 5-mC, 5-hmC, 5-fC, and 5-caC 
modifications produced by the E1879Q TET2 mutant shown as per cent of 
total cytosine modifications. Percentages derived from the mean of three 
independent experiments are shown. P values were determined using a 
two-tailed, paired Student's f-test. 


No other mutations were found in the panel of 67 additional genes 
analysed. 

TET2 encodes methylcytosine dioxygenase, an enzyme that 
catalyses the conversion of 5-methylcytosine (5-mC) into 
5-hydroxymethylcytosine (5-hmC) and the rarer bases 5-formylcytosine 
(5-fC) and 5-carboxylcytosine (5-caC), thereby mediating DNA 
demethylation'*-”? (Fig. 3d, top panel). Methylation at the C5 position 
of cytosine normally represses transcription, and therefore, demeth- 
ylation is expected to activate gene expression. We interrogated the 
functional significance of the E1879Q mutation using plasmids encod- 
ing wild-type TET2 or this TET2 variant that were transfected into 
HEK293T cells (Extended Data Fig. 5a-c). Overexpression of wild-type 
or mutant TET2 proteins was verified by western blotting (Extended 
Data Fig. 5b, c). Analysis of genomic DNA isolated from these 
cells using both dot blotting (Extended Data Fig. 5a, b) and liquid 
chromatography-tandem mass spectrometry (LC-MS/MS) (Extended 
Data Fig. 5d, e) revealed that E1879Q compromises the step-wise 
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Fig. 4 | Effect of TET2 alteration on the epigenetic landscape of CAR 
T cells. a, Total 5-hmC levels in CAR* and CAR” T cells cultured from 
Patient-10. Histograms depict the intensity of intracellular 5-hmC 
staining. b, Genome browser views of ATAC enrichment at the IFNG 
locus of T cells from Patient-10. c, Frequencies of IFNy- and CD107a- 
expressing T cells expanded from Patient-10, unstimulated or stimulated 
with anti-CD3/CD28 antibodies. Insets indicate frequencies of gated cell 
populations. d, Longitudinal differentiation phenotypes of CAR” (left) 
and CAR™ (middle) CD8* T cells from Patient-10. Black arrows denote 
pre-infusion CAR T cells. Differentiation phenotype at the peak of in 
vivo activity is shown in two patients with CLL who showed long-term 
complete responses (Patient-1 and Patient-2) compared to Patient-10 
(right). Pie slices represent T cell subset frequencies. The CTL019 cell level 


oxidation of 5-mC relative to wild-type TET2, with reductions in 5-fC 
and 5-caC occurring (Fig. 3d). Therefore, the clonal expansion of CAR 
T cells in Patient-10 was comprised of a compound heterozygous 
loss-of-function in TET2 on one allele and a hypomorphic E1879Q 
variant on the other allele. 

CAR*V85.1* T cells from Patient-10 exhibited lower total levels of 
5-hmC compared to their CAR~V5.1~ T cell counterparts (Fig. 4a). 
This was presumably the result of disruption of the wild-type TET2 
allele following lentiviral integration, as the TET2 E1879Q variant can 
form 5-hmC (Fig. 3d, Extended Data Fig. 5a, b). To investigate the 
effects of TET2 alteration on CAR T cell fate and function, we per- 
formed assay for transposase-accessible chromatin using sequencing 
(ATAC-seq), which monitors DNA accessibility (Supplementary 
Table 3). The global epigenetic changes between CAR* and CAR™ 
T cells from Patient-10 were modest (Extended Data Fig. 6a). However, 
we found that genes with more accessible chromatin in CAR* com- 
pared to CAR” T cells were enriched in pathways that regulate the 
cell cycle and T cell receptor signalling (Extended Data Fig. 6b, 
Supplementary Tables 4, 5a). ATAC-seq peaks that were reduced or lost 
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Day 
and frequencies of activated CAR T cells expressing HLA-DR (surface 
molecule indicative of T cell activation) at the peak of each patient’s 
response are listed below the pie charts. e, TET2 expression in healthy 
donor CD8* T cells transduced with a scrambled shRNA (control) or 
TET2 shRNA. Error bars depict s.e.m. f, Frequencies of central memory 
(left), effector memory (middle) and effector (right) CD8* T cells 
following shRNA-mediated knockdown of TET2 (n = 12; pooled results 
from 4 independent experiments). g, Proliferation of healthy donor 
CTLO19 cells (n = 8; 3 independent experiments) in response to repetitive 
stimulation (denoted by black arrows) with K562 cells expressing CD19 or 
mesothelin (negative control). CAR T cells were transduced to express either 
a scrambled control or TET2-specific shRNA. P values were determined 
using a two-tailed, paired Student’s t-test. All error bars depict s.e.m. 


in the setting of TET2 biallelic dysfunction included those correspond- 
ing to several regulators of T cell effector differentiation or exhaustion 
such as IFNG, NOTCH2, CD28, ICOS and PRDM1 (Fig. 4b, Extended 
Data Fig. 6c, Supplementary Tables 4, 5b). Furthermore, transcription 
factor motifs within sites that changed accessibility could have affected 
specific transcriptional circuits controlling CAR T cell fate and anti- 
tumour activity in this patient (Extended Data Fig. 6d, Supplementary 
Table 6a, b). Functional analysis of CAR* T cells with biallelically 
altered TET2 cultured from this patient showed a diminished capacity 
to express IFNy and CD107a (a degranulation marker) when activated 
(Fig. 4c), consistent with a less differentiated state. Thus, lentiviral inte- 
gration into TET2 together with a hypomorphic mutation on the sec- 
ond allele reprogrammed the epigenetic landscape of CAR T cells ina 
manner that was consistent with altered T lymphocyte differentiation. 

We next analysed the differentiation state of ex vivo CTLO19 cells 
from Patient-10 and compared it to those of CAR T cells from six 
other patients who responded to this therapy (Extended Data Fig. 7), 
including two subjects with CLL (Patient-1 and Patient-2) who had 
long-term durable remissions (of more than 6 years) and did not have 
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TET2 integrations (data not shown). At the peak of in vivo expan- 
sion and activation marker expression, 65% of the CAR T cells in 
Patient-10 had a central memory phenotype (Fig. 4d, Extended Data 
Fig. 7a) which differed from other responders whose repertoires were 
dominated by CD8* effector memory and effector CTLO19 cells at the 
height of the response (Fig. 4d, Extended Data Fig. 7b). Experimental 
knockdown of TET2 (Fig. 4e) recapitulated the effect of its dysfunction 
in Patient-10 on the differentiation state of both total and CAR*CD8* 
as well as CD47 primary T cells (Fig. 4f, Extended Data Fig. 8a, b), 
implicating TET2 as an epigenetic modulator of human T lymphocyte 
fate. TET2-mediated regulation of CD8* T cell differentiation may not 
occur at the transcriptional level, as we did not observe differential 
TET2 mRNA expression between naive and memory subsets (Extended 
Data Fig. 8c). 

To investigate the effects of TET2 inhibition on CAR T cell function, 
we performed an in vitro serial re-stimulation assay. Repeated stimu- 
lation with CD19-expressing tumour cells enabled TET2 knockdown 
CAR T cells to continue to expand in an antigen-dependent manner, 
whereas re-stimulation of CAR T cells with unaltered TET2 resulted in 
arrest of culture growth (Fig. 4g) without affecting viability (Extended 
Data Fig. 8d). We next examined cytokine production following acute 
(Extended Data Fig. 9a) and chronic (Extended Data Fig. 9b, c) anti- 
gen stimulation. Consistent with our analysis of CD8tCAR* T cells in 
Patient-10, IFN production following activation of CD3 and CD28 
was diminished in CD8* as well as CD4* T cells with reduced TET2 
levels (Extended Data Fig. 9a). A similar decrease was observed for 
TNFa generation (Extended Data Fig. 9a). By contrast, acute produc- 
tion of both TNFa and IL-2 by CD4* T cells was increased upon CAR- 
specific stimulation (Extended Data Fig. 9a). While repeated exposure 
of bulk CTLO19 cells to CD19-expressing tumours also led to decreased 
IFN‘ elaboration (Extended Data Fig. 9b, c), TET2 inhibition resulted 
in the sustained production of various other cytokines following 
multiple rounds of stimulation (Extended Data Fig. 9b). Thus, TET2 
may control human T cell subset-specific cytokine production in an 
antigen-receptor-dependent and/or co-stimulatory signal-dependent 
fashion. 

On the basis of our evaluation of CAR* T cells expanded from 
Patient-10, we predicted that knockdown of TET2 would decrease 
effector molecule expression. Unexpectedly, CAR-specific stimula- 
tion, but not CD3 and CD28 stimulation, increased the expression of 
CD107a (Extended Data Fig. 10a). This may be attributed, at least in 
part, to enhanced cytolytic capacity mediated by 4-1BB over CD28 
co-stimulation due to NKG2D upregulation”’. Because CD8* T cell 
differentiation is accompanied by decreased methylation and upregu- 
lated gene expression at effector gene loci, including GZMB (encoding 
granzyme B) and IFNG”", we subsequently investigated whether TET2 
inhibition influences critical components of the cytotoxic machinery. 
In contrast to IFNy, TET2 reduction in CD8* CAR* T cells increased 
granzyme B and perforin expression (Extended Data Fig. 10b). These 
changes were associated with the heightened cytotoxic activity of TET2 
knockdown CAR T cells (Extended Data Fig. 10c). 

The above findings suggest that TET2 dysfunction may produce 
potent CAR T cells with properties of short-lived memory cells that 
can expand and elicit effector responses, as well as long-lived, persistent 
memory cells. We thus examined additional effector or memory mark- 
ers in CD8*CAR* and CAR T cells using post-infusion samples from 
Patient-10 and other long-term responding patients with CLL. At the 
height of the response, tumour-reactive CAR* T cells from Patient-10 
possessed higher levels of granzyme B (Extended Data Fig. 11a) and 
eomesodermin (Eomes; a transcription factor involved in the formation 
and maintenance of the CD8* memory T cell pool; Extended Data 
Fig. 11b) compared to matched CAR‘ T cells, unlike the other complete 
responders. All clinically active CD8* CAR* T cells in these patients 
expressed CD27, a co-stimulatory receptor involved in the generation 
of T cell memory (Extended Data Fig. 11b). The frequency of CTL019 
cells expressing KLRG1, a marker of T lymphocyte senescence that is 
known to be regulated by DNA methylation”, was significantly lower 
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on CAR* T cells from Patient-10 than on cells from other patients 
(Extended Data Fig. 11b). A high frequency of Ki-67-positive CAR* 
T cells was observed at the peak of in vivo expansion in Patient-10 
(Extended Data Fig. 11a), further suggesting that TET2 is required for 
CAR-specific CD8* T cell proliferation. These observations collectively 
support the idea that TET2 dysfunction promotes the development 
of human memory CAR T cells that can elicit effective anti-tumour 
responses. The remission observed in Patient-10 was likely to result 
from the marked increase in the number of CTLO19 cells with TET2 
dysfunction, despite the reduction in certain effector functions. 

In summary, profound clonal expansion of a single CAR-transduced 
T cell with biallelic TET2 dysfunction transformed a non-curative 
response into a deep molecular remission in a seventy-eight-year-old 
patient with CLL. Characterization of this T lymphocyte population 
revealed that changes to the epigenetic environment altered the dif- 
ferentiation state and proliferative capacity of the cells and translated 
into a considerable therapeutic effect. Although our initial studies were 
based on an extensive analysis of one subject, recapitulation of the effect 
of TET2 dysfunction on CAR T cell fate and anti-tumour activity in 
relevant culture systems involving primary human T-lymphocytes sup- 
ports the discovery of a modifiable epigenetic pathway that can shape 
the immune response. Thus, targeting the epigenome may improve 
the efficacy and persistence of CAR T cells. Finally, our results indi- 
cate that the progeny of a single CAR T cell are sufficient to mediate 
potent anti-tumour effects in advanced leukaemia, a finding that has 
substantial clinical implications regarding the delivery of bespoke cel- 
lular therapies. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 


Patient samples. Patients were enrolled in institutional review board (IRB)- 
approved clinical protocol: ‘Genetically engineered lymphocyte therapy in treating 
patients with B cell leukemia or lymphoma that is resistant or refractory to chemo- 
therapy’ (Clinical Trials.gov number: NCT01029366) which was designed to evalu- 
ate the safety and efficacy of the adoptive transfer of autologous T cells expressing 
CD19 chimaeric antigen receptors (CAR19) that incorporate TCR-¢ and 4-1BB 
costimulatory domains (CTLO19). Participants provided written informed consent 
in accordance with the Declaration of Helsinki and the International Conference 
on Harmonization Guidelines for Good Clinical Practice. All regulations were 
followed according to this United States (US) Food and Drug Administration 
(FDA)-approved clinical protocol. The current study is a secondary investigation 
using patient samples collected from existing clinical trials. Therefore, the sample 
sizes in this report were determined by the original clinical trial designs and sample 
availability; no additional inclusion/exclusion criteria were applied. 

Cell lines. NALM-6, K562 and HEK293T cell lines were originally obtained from 
the American Type Culture Collection (ATCC). OSU-CLL cells” were obtained 
from Ohio State University. Cells were expanded in RPMI medium containing 
10% fetal bovine serum (FBS), penicillin and streptomycin at a low passage and 
tested for mycoplasma using the MycoAlert detection kit as per the manufac- 
turer’s (Lonza) instructions. Authentication of cell lines was performed by the 
University of Arizona (USA) Genetics Core based on criteria established by the 
International Cell Line Authentication Committee. Short tandem repeat (STR) 
profiling revealed that these cell lines were well above the 80% match threshold. 
NALM-6 and OSU-CLL cells were engineered to constitutively express click beetle 
green (CBG) luciferase/enhanced GFP (eGFP). K562 cells were transduced with a 
lentiviral vector encoding human CD19 (K562-CD19), and negative control cells 
were K562 cells expressing mesothelin (K562-mesothelin)”*. Following transgene 
introduction, cells were sorted on a FACSAria (BD) to obtain a >99% pure pop- 
ulation. Mycoplasma and authentication testing were routinely performed before 
and after molecular engineering. 

CAR T cell manufacturing and correlative studies. Peripheral blood T cells 
for CTLO19 cell manufacturing were obtained by leukapheresis as previously 
described'. The processing, flow cytometric evaluation, quantification of 
cytokines and quantitative PCR analyses (that is, detection of CAR19 and the 
TCRV£5.1+ dominant clone) on pre- and post-CTLO19 infusion samples were 
conducted as previously reported”*. Next-generation sequencing of immuno- 
globulin heavy chain (IGH) rearrangements was carried out on DNA isolated 
from blood and marrow samples. In brief, primers specific for the variable and 
joining gene segments of the third complementarity-determining region of the 
IGH were used for amplification and deep sequencing to identify the leukaemic 
clone relative to baseline samples (Adaptive Biotechnologies). The frequency of 
the leukaemic clone in each sample was calculated using the number of total and 
unique productive reads. 

Population doublings of clonal CAR T cells in the blood (assuming a 5-litre 
volume of peripheral blood) were calculated using the equation A;= Ao2", in which 
nis the number of population doublings, Ao is the input number of cells (assuming 
asingle TET2-disrupted cell), and A; is the number of CAR T cells at day 121 (peak 
CAR T cell expansion; 95.445 CD3* CD8* CAR‘ cells per microlitre whole blood). 

These correlative assays were carried out at time points defined by the clinical 
protocol in parallel with disease response evaluations. This clinical trial was a 
single-treatment study; comparisons between patients in the current study were 
defined by the observed clinical responses. Investigators were blinded to clinical 
responses as correlative assays were conducted using de-identified subject samples. 
Flow cytometry. Routine assessments of CAR T cell expansion and persistence as 
well as measurement of B-CLL burden in the blood and marrow were conducted 
according to our previously published methods using a six-parameter Accuri C6 
flow cytometer (BD)!. T cell immunophenotyping was performed by surface 
staining with flow cytometry antibodies immediately following pre-incubation 
with Aqua Blue dead cell exclusion dye (Invitrogen). The Alexa Fluor 647- 
conjugated monoclonal antibody that was used to detect the CAR molecule 
has previously been described”’. Commercially available flow cytometry antibodies 
against the following antigens used in the study are as follows: CD3 allophycocya- 
nin (APC) H7, CCR7 PE/CF594, CD107a APC, (BD Biosciences); CD45RO bril- 
liant violet (BV)570, CD8 BV650, CD4 BV785, perforin BV421, Ki-67 Alexa Fluor 
(AF)700, TNFa BV605 (Biolegend); TCRVB5.1 APC, IFNy PE, IL-2 PerCP-eFluor 
710, Eomes FITC (eBioscience); Granzyme B PE/cyanine5.5 (Invitrogen). For 
intracellular staining, cells were fixed and permeabilized using the Foxp3 Fixation/ 
Permeabilization Kit (eBioscience) or the Cytofix/Cytoperm Kit (BD Biosciences). 
The GolgiStop protein transport inhibitor containing monensin and GolgiPlug 
protein transport inhibitor containing brefeldin A (BD Biosciences) were used 
when staining for intracellular cytokine production. All flow cytometry reagents 
were titrated before use. Samples were acquired on an LSRFortessa (BD) and data 
were analysed using FlowJo software (TreeStar). 
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TCRV® deep sequencing. Genomic DNA from pre-infusion T cells, peripheral 
blood samples or sorted post-infusion T cells was isolated using the DNeasy Blood 
and Tissue Kit (Qiagen). TCRV$ deep sequencing was carried out by immunoSEQ 
(Adaptive Biotechnologies). Only productive TCR rearrangements were used in 
the assessment of TCR clonotype frequencies. 

Integration site analysis. Vector integration sites were detected from genomic 
DNA as described previously**°. Genomic sequences were aligned to the human 
genome by BLAT (hg38, version 35, >95% identity) and statistical methods for 
analysing integration site distributions were carried out as previously described*?. 
The SonicAbundance method was used to infer the abundance of cell clones from 
integration site data”*. All samples were analysed independently in quadruplicate 
to suppress founder effects in the PCR and stochastics of sampling. 

Detection of TET2 chimaeric transcripts. RNA was isolated from cells and used 
as template with the One-Step RT-PCR Kit (Qiagen). Primers were designed to 
target the exon 9 and 10 boundaries of TET2, flanking the vector integration site 
and sequences internal to the anti-CD19BBC CAR lentiviral vector. These included 
various regions of the vector sequence (Extended Data Fig. 4a). Reactions were 
carried out as per the manufacturer's specifications. Thermocycling temperatures 
and time for reverse transcription as well as PCR activation were conducted using 
the following cycling conditions: 30s at 94°C for melting, 30s at 57°C for primer 
annealing and 1.5 min at 72°C for primer extension (35 cycles). A final extension 
at 72°C was held for 10 min for each sample. PCR products were visualized on 
ethidium bromide agarose gels (1.5% by weight) via electrophoresis and ultraviolet 
imaging. 

Next-generation sequencing of post-infusion CAR T cell samples. CART 
and CAR™ CD8* T cells were purified from post-infusion PBMC samples cor- 
responding to the peak of in vivo expansion in Patient-10. T cells were sorted 
using a FACSAria (BD) and genomic DNA was isolated from these lymphocytes 
as described above. A custom targeted next-generation sequencing panel of 68 
genes associated with haematologic malignancies was then used (TruSeq Custom 
Amplicon, Illumina), and sequencing was carried out on an Illumina MiSeq 
(Illumina). A minimal mean depth of 2,110 reads was achieved for the specimens 
sequenced, with the assay and bioinformatics performed as previously described*!. 
The data presented are based on human reference sequence UCSC build hg 19 
(NCBI build 37.1). 

Identifying the TET2 allele hosting vector integration. A PCR assay was devel- 
oped to amplify the region of DNA (approximately 4kb) between the vector integra- 
tion and the locus of the c.5635G>C mutation. Primers were designed to anneal to 
the vector sequence (MKL-3: 5/-CTTAAGCCTCAATAAAGCTTGCCTTGAG-3’) 
and multiple locations downstream of the mutation, chr4:105,276,145 
(50 bp: 5’ GCTGGTAAAAGACGAGGGAGATCCTG-3’, 99 bp: 5’-GGCTT 
CCCAAAGAGCCAAGCCATG-3’, 120 bp: 5’-CACGGGCTTTTTCA 
GCCATTTTGGC-3’). Genomic DNA samples from sorted CAR* and CAR™ 
CD8* T cells corresponding to the peak of clonal expansion in Patient-10 were 
selected for amplification. PCR reactions were carried out with Long Amp Taq 
polymerase (New England BioLabs) and 100-400 ng of DNA isolated from 
samples, according to the manufacturer’s recommendations. Amplification was 
conducted as follows: 94°C for 305s, 30 cycles of (94°C for 30s, 60°C for 30s, and 
65°C for 3 min 20s), and a final extension of 65°C for 10 min. Amplified prod- 
ucts were separated by electrophoresis on a 1% ethidium bromide agarose gel and 
prominent bands of 4kb in size were isolated using the QlAquick Gel Extraction 
Kit (Qiagen). Isolated bands were ligated into pCR2.1 and cloned into TOP-10 
chemically competent cells using the TOPO TA Cloning Kit (Invitrogen). Purified 
plasmids were sequenced with M13 forward and reverse primers using standard 
Sanger technology. Sequencing results were aligned to the vector sequence and 
reference genome. 

Characterization of the TET2 E1879Q mutation. The previously character- 
ized and crystalized human TET2-CS variant (1129-1936 A1481-1843) with 
an N-terminal FLAG-tag was expressed using a pLEXm expression vector?*"?. 
The E1879Q mutation or mutation of the catalytic H1382Y and D1384A (HxD 
mutant) were generated by standard means. HEK293T cells were cultured in 
DMEM with GlutaMAX (Thermo Fisher Scientific) and 10% FBS (Sigma). Cells 
were transfected with wild-type (WT), mutant hTET2-CS or an empty pLEXm 
vector control using Lipofectamine 2000 (Thermo Fisher Scientific) according 
to the manufacturer's protocol. Medium was changed 24h after transfection, and 
cells were collected by trypsinization 48 h after transfection and resuspended in 
phosphate-buffered saline. Genomic DNA was isolated from four-fifths of the cells 
using the DNeasy Blood and Tissue Kit (Qiagen) and the remaining one-fifth of 
the cells were lysed using CytoBuster Protein Extraction Reagent (EMD Millipore) 
for western blot analysis. 

DNA blots for cytosine modifications were carried out according to estab- 
lished protocols**. Purified DNA from HEK293T cells was diluted to 15 ng/ul in 
Tris-EDTA (TE) buffer, pH 8.0 for twofold dilutions of each sample. One-quarter 
volume of 2M NaOH-50mM EDTA was added to each sample. The DNA was 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


denatured for 10 min at 95°C and transferred quickly to ice, followed by the addi- 
tion of 1:1 ice cold 2M ammonium acetate. Polyvinylidene difluoride (PVDF) 
membranes were cut to size, wet with MeOH and equilibrated in TE buffer and 
then assembled into the PR 648 Slot Blot Manifold (GE Healthcare Life Sciences). 
Each well was washed with 40011 TE drawn through with gentle vacuum, and 
genomic DNA was loaded, followed by another TE wash. Membranes were blocked 
for 2h in 5% milk-TBST, washed thrice with TBST, and blotted at 4°C overnight 
with primary antibodies against each modified cytosine (1:5,000 mouse anti-5-mC 
(Abcam); 1:10,000 rabbit anti-5-hmC (Active Motif); 1:5,000 rabbit anti-5-fC 
(Active Motif); 1:10,000 rabbit anti-5-caC (Active Motif)). Blots were then washed, 
incubated with a 1:2,000 dilution of a secondary horse anti-mouse-horseradish 
peroxidase (HRP; Cell Signaling Technology) or 1:5,000 goat anti-rabbit- HRP 
(Santa Cruz Biotechnology) for 2 h, washed and imaged using the Immobilon 
Western Chemiluminescent HRP Substrate (Millipore) and the Amersham Imager 
600 (GE Healthcare Life Sciences). 

For protein detection, clarified cell lysates were run on 8% sodium dodecyl 
sulphate polyacrylamide (SDS-PAGE) gels. Gels were transferred together onto a 
PVDF membrane using the iBlot 2 Gel Transfer Device (Thermo Fisher Scientific). 
Membranes were blocked for 2h at room temperature with 5% (w/v) milk in Tris- 
buffered saline with 0.1% (v/v) Tween-20 (TBST), washed three times with TBST 
and blotted either with primary 1:10,000 anti-FLAG M2 (Sigma) or 1:1,000 anti- 
Hsp90a/8 (Santa Cruz Biotechnology) antibodies at 4°C overnight. Following 
incubation, membranes were washed and blotted with a 1:5,000 dilution of sec- 
ondary goat anti-mouse-HRP antibodies (Santa Cruz Biotechnology) for 2h, 
washed and imaged with Immobilon Western Chemiluminescent HRP Substrate 
(Millipore) on an Amersham Imager 600 (GE Healthcare Life Sciences). 

For LC-MS/MS, 1-2 1g of genomic DNA from each sample was degraded 
to component nucleosides with 1 U DNA Degradase Plus (Zymo Research 
Corporation) at 37°C overnight. The nucleoside mixture was diluted tenfold into 
0.1% formic acid, and injected onto an Agilent 1200 Series HPLC with a 51m, 
2.1 x 250mm Supelcosil LC-18-S analytical column (Sigma) equilibrated to 45°C 
in buffer A (5mM ammonium formate, pH 4.0). The nucleosides were separated 
in a gradient of 0-15% buffer B (4mM ammonium formate, pH 4.0, 20% (v/v) 
methanol) over 8 min at a flow rate of 0.5 ml per minute. Tandem MS was per- 
formed by positive ion mode ESI on a 6460 triple-quadrupole mass spectrometer 
(Agilent) with a gas temperature of 250°C, a gas flow of 121/min, a nebulizer 
pressure of 35 psi, a sheath gas temperature of 300°C, a sheath gas flow of 
111/min, a capillary voltage of 3,500 V, a fragmentor voltage of 70 V and a delta EMV 
of +1,000 V. Collision energies were optimized to 10 V for 5-mC and 5-fC; 15 V for 
5-caC; and 25 V for 5-hmC. Multiple reaction monitoring (MRM) mass transitions 
were 5-mC 242.11—126.066 m/z; 5-hmC 258.11 124.051; 5-fC 256.09— 140.046; 
5-caC 272.09 156.041; and T 243.10—+127.050. Standard curves were generated 
using standard nucleosides (Berry & Associates) ranging from 2.5 1M to 610pM 
(12.5 pmol to 3 fmol total). Digested oligonucleotides containing equimolar 
amounts of each modified cytosine were used as quality control samples. The 
sample peak areas were fit to the standard curve, as adjusted by the quality control 
samples to determine the amount of each modified cytosine in the genomic DNA 
sample. Amounts are expressed as the percentage of total cytosine modifications. 
Measurement of total 5-hydroxymethylcytosine levels. CD8* T cells were puri- 
fied from post-infusion PBMC samples using the EasySep Human CD8* T cell 
Immunomagnetic Negative Selection Kit (StemCell Technologies) and expanded 
ex vivo using a previously reported rapid expansion protocol’. Following culture, 
CD8*tCAR*TTCRV$5.1* and CD8tCAR~TCRV$5.1~ T cells were sorted on a 
FACSAria (BD). Cells were permeabilized and treated with 300 |1g/ml DNase I 
for 60 min at 37°C. After washing, samples were incubated with an anti-5-hmC 
monoclonal antibody or an isotype control for 30 min, followed by staining with an 
Alexa Fluor 647-conjugated secondary antibody, as previously described**. Cells 
were immediately acquired on an LSRFortessa (BD). 

Global chromatin profiling by ATAC-seq. Following culture, 
CD8*+CAR*TCRV85.1* and CD8*CAR-TCRV£5.1~ T cells were sorted on a 
FACSAria (BD). ATAC-seq was carried out as previously described*”**. Two rep- 
licates were performed for each ex vivo expanded CD8+ CAR*TCRV$5.1* and 
CD8*CAR-TCRV£85.1~ T cell culture. In brief, nuclei were isolated from 200,000 
sorted CD8* T cells for each replicate, followed by the transposition reaction in 
the presence of Tn5 transposase (Illumina) for 45 min at 37°C. Purification of 
transposed DNA was subsequently completed with the MinElute Kit (Qiagen) 
and fragments were barcoded with dual indexes (Illumina Nextra). Paired-end 
sequencing (2 x 75-bp reads) was carried out using the Illumina NextSeq 500. 
Raw sequencing data were processed and aligned to the GRC37/hg19 reference 
genome using Bowtie 2 and regions of significant enrichment were identified using 
MACS v1.4.2. Merged peak lists were created using BedTools. ATAC sequencing 
tag enrichment and DNA motif analysis across the merged peak list were carried 
out using HOMER (http://homer.salk.edu). Gene Ontology pathway analysis was 


performed with Metascape (http://metascape.org). Only high-confidence peaks 
were used for gene ontology and DNA motif analyses. For these evaluations, peaks 
with an enrichment score less than 5 were filtered out as previously established’. 
CAR T cell differentiation and expansion potency assays. Bulk primary human 
T cells were activated with paramagnetic beads coated with anti-CD3 and anti-CD28 
monoclonal antibodies as previously described” and transduced with lentiviral 
vectors encoding the CD19BBC CAR and shRNA hairpin sequences targeting TET2 
or a scrambled control with GFP co-expression (Cellecta). Knockdown efficiency 
in CTLO19 cells following shRNA transduction was determined by real-time quan- 
titative PCR with Taqman gene expression assays (Applied Biosystems) for TET2 
(assay Hs00325999_m1) and GAPDH (assay (Hs03929097_g1), which served as a 
loading and normalization control. Following 14 days of culture, the differentiation 
phenotype of these cells was determined by flow cytometry. GFP*CAR* T cells 
were sorted on a FACSAria (BD) and combined 1:1 with irradiated K562 cells engi- 
neered to express CD19* or mesothelin as a negative control. CTLO19 cells were 
serially re-stimulated with irradiated K562 targets three times, with absolute counts 
and viability assessments taken at regular intervals over 17 days. Cell counts and 
viability measurements were obtained using the LUNA Automated Cell Counter 
(Logos Biosystems). Supernatants were collected 24h after each stimulation for 
longitudinal measurements of cytokine levels in cultures, as previously described“! 
Intracellular cytokine, perforin and granzyme B analysis. CD8* T cells from 
Patient-10 were stimulated 3:1 with paramagnetic beads coated with anti-CD3 and 
anti-CD28 monoclonal antibodies for 6h in the presence of CD107a monoclonal 
antibody and the golgi inhibitors brefeldin A and monensin. Cells were washed 
and stained with live/dead viability dye, followed by surface staining for CD3, 
CD8 and TCRV85.1. These lymphocytes were subsequently fixed/permeabilized 
and intracellularly stained for IFNy. CAR T cells generated from healthy donors 
(TET2 knockdown or control) were stimulated in the same way with CD3/CD28 
beads or beads coated with an anti-idiotypic antibody against CAR19. Cells were 
then stained for surface antigens (CD3, CD4, CD8 and CAR19). After fixation and 
permeabilization, intracellular staining for IFNy, TNFa and IL-2 was performed. 
For perforin and granzyme B analysis, CTLO19 cells that had been transduced 
with a TET2 or scrambled control shRNA were expanded for 14 days and cryopre- 
served. These CAR T cells were then thawed and rested for 4h, followed by live/ 
dead and surface staining for CD3, CD8 and CAR19. Intracellular staining for 
perforin and granzyme was carried out following fixation and permeabilization. 
Cells were analysed on an LSRFortessa (BD). 
Cytotoxicity assay. Healthy donor CTLO19 cells transduced with a shRNA directed 
against TET2 or a scrambled control were co-cultured with CBG luciferase-expressing 
NALM-6 and OSU-CLL cell lines at the indicated ratios for 16 h*". Cell extracts 
were created using the Bright-Glo Luciferase Assay System (Promega Corporation) 
and substrate was added according to the manufacturer’ instructions. Luciferase 
measurements were taken on a SpectraMax luminescence microplate reader 
(Molecular Devices), and specific lysis was calculated as previously described*". 
Analysis of TET2 gene expression levels in T cell subsets. TET2 expression levels 
were determined by analysing a published gene expression dataset (https://www. 
ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE23321) for CD8* T cell subsets 
(naive, stem cell memory, central memory and effector memory) isolated from 
three healthy human subjects**. Genechip (Affymetrix) data were processed with 
the Bioconductor Oligo software package* (release 3.6, Bioconductor) using the 
RMA method“. 
Statistical analyses. Normality was assessed for all data using the D’Agostino-— 
Pearson omnibus test. For integration site data analysis, genomic feature data com- 
parisons were carried out as previously described using \’, Fisher’s exact tests, or a 
combination of Bayesian model averaging, conditional logit and regression’”*>**. 
Assessments of T cell differentiation and function in shRNA-mediated TET2 
knockdown experiments were performed using a paired Student’s t-test. Estimates 
of variation within each group of data are presented as error bars. Analyses were 
performed with SAS (SAS Institute), Stata 13.0 (StataCorp) or GraphPad Prism 6 
(GraphPad Software). Cytokine heat maps were constructed using Morpheus soft- 
ware (Broad Institute; accessed at https://software.broadinstitute.org/morpheus/). 
All tests were two-sided. Exact P values are reported. P< 0.05 was considered 
statistically significant. 
Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 
Data availability. Sequencing data are available at the NCBI Sequence Read 
Archive (accession number SRP 136348; analyses of lentiviral integration sites, 
TET2 chimaeric transcripts and alleles as well as mutations associated with 
hematologic malignancies), Adaptive Biotechnologies’ immuneACCESS database 
(https://doi.org/10.21417/B72D1Q; TCRV and BCR IgH immunosequencing) 
and Gene Expression Omnibus (accession number GSE112494; ATAC-seq). Any 
other data that support the findings of the study will be made available upon rea- 
sonable request. 
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Extended Data Fig. 1 | Timeline of disease clearance by CAR T cells in results of bone marrow assessments, tumour cytogenetics and CAR T cell 
Patient-10. An outline of clinical findings in Patient-10, including the persistence. CTLO19 cell infusion time points are indicated by red arrows. 
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Extended Data Fig. 2 | CAR T cell detection and profiling of immune 
cell populations in Patient-10 and other responders. a, Pre- and post- 
infusion kinetics of CAR T cell expansion (CD3*, CD8* and CD8_) 

are shown in Patient-10 compared to other responders. The number of 
circulating CTLO19 cells was calculated based on frequencies of CD3*, 
CD8* and CD8~ CAR T cell populations and absolute cell counts. 

b, Flow cytometry gating strategy to identify peripheral blood CAR T cells 
in Patient-10. c, Relative percentages of CTLO19 cells in the CD4 and 

CD8 compartments of this patient. T cells from a healthy subject served 
as a negative control. d, The persistence of CAR T cells in the peripheral 
blood of Patient-10 was determined by qPCR. The average threshold cycle 


(Ct) value obtained from three replicates and a standard deviation (SD) 

is listed. Calculations of CAR T cell abundance are reported as average 
marking per cell as well as transgene copies per microgram of genomic 
DNA. e, Frequencies of circulating B cells in Patient-10 compared to a 
healthy subject. Pre-gating was performed to exclude dead cells as well 

as doublets, and all gating thresholds were based on fluorescence minus 
one (FMO) controls (representative of two independent experiments). 

f, Enumeration of various immune cell populations in the blood of 
Patient-10. The frequency of each population is listed in a separate column 
that corresponds to its phenotypic marker. 
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Extended Data Fig. 3 | Clonal composition of CAR T cells from 
Patient-10 and other subjects treated with CTL019. a, TCRVB 
distribution in CD8~ (left) and CD8* (right) CAR T cells in the cellular 
infusion product of Patient-10. b, Relative frequencies of CAR T cell 
clones in three patients who had complete responses to CTLO19 therapy, 
summarized as stacked bar graphs. Each colour (horizontal bar) denotes a 


major cell clone, as marked by lentiviral integration sites. c, Integration 
site analysis in CAR T cells from two partially responding patients. 

d, In vivo expansion of CAR T cells in the above patients as determined 
by quantifying the average CAR transgene copies per microgram DNA at 
each time point. 
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R4 Vector EF1a promoter Reverse CAAGGGCCATAACCCGTAAA 
R5 Vector 4-1BB region (CAR) Reverse GCCATCTTCCTCTTGAGTAGTT 
R2 Vector LTR poly(A) signal Reverse GGCAAGCTTTATTGAGGCTT 
F2 Vector cPPT element Forward AGGAATTTGGCATTCCCTACA 
F3 Vector EF1a promoter Forward GGAGAACCGTATATAAGTGCAGTAG 
F4 Vector 4-1BB region (CAR) Forward CCTTCTCCTGTCACTGGTTATC 
F5 Vector R element Forward GAAGGGCTAATTCACTCCCAA 
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Extended Data Fig. 4 | Detection of TET2 chimaeric transcripts in 
Patient-10 CAR T cells and DNA sequencing for mutation detection. 
a, The strategy for detection of polyadenylated RNA corresponding to 
truncated TET2 transcripts is depicted. Boxes represent the genomic 
regions between TET2 exons 9 and 10 with the integrated vector present. 
Blue and red arrows indicate general locations of the forward and reverse 
primers, which are listed below the diagram. LTR, long terminal repeat; 
cPPT, polypurine tract; EFla, elongation factor 1-a promoter. Sequences 
corresponding to the splice junctions for the three chimaeric messages 
(five total junctions) are listed in the bottom chart. Underlines indicate 
consensus splice donors and acceptors. b, Visualization of chimaeric TET2 
RT-PCR products. PCR products were separated on a native agarose 


Reverse Complement: A A A GC G TIGIA GCTGCA 
Wildtype Sequence: A A AGC G TIGIA GCTGCA 


gel and stained with ethidium bromide. Expected sizes of amplicons are 
listed above the gel. Truncated transcripts are highlighted by blue boxes. 

A key to the RT-PCR reactions is shown below the diagram. *Band size 
not determined (two independent experiments). c, Genes interrogated 

by the next generation sequencing panel used to analyse DNA isolated 
from CD8tCAR* T cells and CAR™ T cells in Patient-10 at the peak of his 
response. d, Sanger sequencing of specific amplifications corresponding to 
the allele that was disrupted by integration of the CAR lentivirus is shown. 
The mutation that was detected by next generation sequencing of total 
genomic DNA from CAR‘ T cells (Fig. 3c) is not present in the TET2 allele 
hosting the lentiviral integration site. 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5 | Analysis of DNA methylation variants from 
HEK293T cells overexpressing TET2. a, Depiction of sequential 
oxidations of 5-mC to 5-hmC, 5-fC and 5-caC catalysed by TET2 (top). 
Dot blots for 5-mC, 5-hmC, 5-fC and 5-caC in genomic DNA (gDNA) 
isolated from HEK293T cells transfected with the E1879Q TET2 mutant 
are shown. Assay controls include an empty vector, wild-type TET2 and a 
catalytically inactive (HxD) TET2 mutant (bottom left). A western 

blot using anti-FLAG antibody to detect hTET2 in the above cells is 

also shown. Hsp90a/8 was used as a loading control (bottom right). 
Brightness and contrast were adjusted evenly across blots. b, Original, 
uncropped DNA (left) and western (right) blots. A dilution series was 
used for semiquantitative analysis of DNA methylation variants. 
Representative results of three independent experiments are shown. 

c, Validation of anti-FLAG and anti-Hsp90a/B antibodies. Serial dilutions 
of lysates (0.008, 0.04, 0.2, 0.8, 4.0 1g pl~) obtained from HEK293T 

cells transfected with wild-type TET2. Gels were probed by western 


blot. Both chemiluminescence (left) and digital (right) images were 
captured, demonstrating that these antibodies exhibit specificity for 

the expected FLAG tag and Hsp90 based on molecular weight markers. 
d, 5-mC, 5-hmC, 5-fC and 5-caC nucleosides were analysed at fixed 
concentrations using LC-MS/MS to generate standard curves. The area 
under the curve (AUC) was calculated for each MS/MS fragment. In the 
case of 5-hmC, the slope was further adjusted because of quality control 
analysis of an equimolar mixture of oligonucleotides, each containing 

a single modification. e, Analysis of gDNA from individual biological 
replicates within each HEK293T cell group. The top chart lists the raw 
AUCs that were converted to relative amounts of modified cytosine 
(middle) according to their signal intensities from the respective standard 
curves. The percentage of each modified cytosine calculated for each 
sample is shown in the bottom chart. Results are from three independent 
experiments. 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | Global chromatin profiling of CAR* and 

CAR~ T cells from Patient-10. a, Venn diagrams of high-confidence 
differential ATAC-seq regions (left) and enrichment of those peaks in 
regions of the diagrams (right) in CAR* and CAR-CD8* T cells expanded 
from Patient-10 (two biological replicates analysed in two independent 
experiments). Boxes extend from the 25th to 75th percentiles, the middle 
line denotes the median and whiskers show minimum and maximum. 

b, Gene Ontology terms associated with chromatin regions that are 
significantly more open in CD8*CAR? T cells from Patient-10 compared 
to their matched CD8+CAR~ T cell counterparts. c, Ontology analysis for 
chromatin regions that are less accessible in CD8*CAR* T cells than in 
CD8*CAR~ T cells. d, Enrichment of transcription factor (TF) binding 


motifs in chromatin regions gained or lost in CARt compared to CAR™ 

T cells from Patient-10. Transcription factor motifs that were potentially 
more accessible in increased ATAC-seq peaks of CAR? T cells included 
E26 transformation-specific (ETS) (GABPa, ELF1, Elk4) and zinc finger 
(ZF) transcription factor (Sp1) binding sites that are known to be enriched 
in human CD8* T cells before differentiation occurs*’. Transcription 
factor motifs that were potentially less accessible owing to reduced ATAC- 
seq peaks in CAR* T cells from Patient 10 (NF-KB, IRF1, NFAT-AP1 and 
CTCF) are enriched in terminally differentiated effector and exhausted 

T cells and have known key roles in forming the epigenetic landscape that 
programs their biology*’. 
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Extended Data Fig. 7 | The differentiation state of CAR T cells in frequencies of the gated cell populations. b, Example gating strategy 
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contour plots of flow cytometric data depicting the frequency of CARt CAR‘ T cells from a complete responder (top left). Line graphs depict the 
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Extended Data Fig. 8 | Effect of TET2 expression on T cell 
differentiation and viability. a, Representative flow cytometry plots 
showing the differentiation state of healthy donor CD8*CAR* T cells after 
transduction with a scrambled shRNA (control) or shRNA targeting TET2. 
Insets define frequencies of gated populations. b, Frequencies of healthy 
subject CAR*CD8* (top) and CAR*CD4* (bottom) T cells according to 
differentiation phenotype following control or TET2 shRNA transduction 
(n= 10; pooled results from four independent experiments). P values were 
determined using a two-tailed, paired Student's t-test. c, Comparison of 
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the expression levels of TET2 in naive and memory CD8* T cell subsets 
from three healthy donors. Two variants encoding different isoforms have 
been identified for this gene in humans. Expression levels of each TET2 
variant were estimated by measuring the probe intensity from microarray 
analysis. d, Viability of CAR* T cells transduced with a TET2 shRNA or 
scrambled control and restimulated with K562 cells expressing CD19 
(n= 12; pooled results from three independent experiments). Each arrow 
indicates the time point at which CAR T cells were exposed to antigen. 
Error bars depict s.e.m. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | CAR T cell cytokine profiles following TET2 
inhibition. a, Representative flow cytometry of acute intracellular 
cytokine production by healthy donor (n = 6; three independent 
experiments) CAR T cells transduced with a TET2 shRNA or a scrambled 
control shRNA (left). Production of IFNy, TNFa and IL-2 by total CD3*, 
CD4* and CD8* CAR T cells is shown. These cells were stimulated with 
beads coated with anti-CD3 and anti-CD28 antibodies (top right) or 
CAR anti-idiotypic antibodies (bottom right). Boxes represent the 25th to 
75th percentiles, the middle line denotes the median and whiskers depict 
minimum and maximum. b, Heat map and cluster analysis of cytokine 


profiles for CAR T cells transduced with a TET2 shRNA or scrambled 
control and serially restimulated with irradiated K562 cells expressing 
CD19 are shown. Colours represent scaled cytokine data corresponding 
to each stimulation time point. Hierarchal clustering was used to generate 
the cluster dendrogram and cytokine response groups. c, Production of 
IFNy (top), TNFa (middle) and IL-2 (bottom) by TET2 knockdown or 
control CAR T cells (1 = 6; three independent experiments) following 
restimulation with CD19 antigen. Black arrows indicate when CAR T cells 
were exposed to CD19-expressing K562 cells. Error bars denote s.e.m. All 
P values were determined using a two-tailed, paired Student's t-test. 
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Extended Data Fig. 10 | Effect of TET2 knockdown on the cytotoxic 
machinery of CAR T cells. a, Flow cytometry plots showing the frequency 
of TET2 knockdown or control CAR T cells expressing CD107a (a marker 
of cytolysis) following CD3 and CD28 or CAR-specific stimulation (left). 
Summarized data from analysis of CAR T cells manufactured from n= 6 
different healthy donors is shown (right). b, Representative histograms 
illustrating expression levels of granzyme B and perforin in CAR T cells in 
the setting of TET2 inhibition as compared to its counterpart control (left). 
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Pooled data from CAR T cells of n=5 healthy donors are summarized on 
the right. c, Cytotoxic capacity of CTLO19 cells (transduced with a TET2 
or scrambled control shRNA) after overnight co-culture with luciferase- 
expressing OSU-CLL (left) or NALM-6 (right) cells. Untransduced T cells 
were included as an additional group to control for non-specific lysis. 

P values were determined using a two-tailed, paired Student’s t-test. All 
data were pooled from three independent experiments. 
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Extended Data Fig. 11 | Effector and memory molecule expression by 
CAR T cells from Patient-10 compared to those from other responding 
subjects. a, Expression of granzyme B (left) and the frequency of CAR™ 
and CAR‘ T cells co-expressing granzyme B/Ki-67 (right panel) at the 
peak of in vivo CTL019 expansion in Patient-10 compared to three other 
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complete responders. b, Representative histograms of intracellular Eomes 
expression (left), and contour plots depicting frequencies of CD27- 
(middle) and KLRG1-expressing (right) lymphocytes in the same cell 
populations of these patients. These results are representative of three 
experiments repeated independently with comparable findings. 
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Phase-separation mechanism for C-terminal 
hyperphosphorylation of RNA polymerase II 


Huasong Lu!’, Dan Yu’, Anders S. Hansen’, Sourav Ganguly’, Rongdiao Liu!, Alec Heckert?, Xavier Darzacq? & Qiang Zhou?* 


Hyperphosphorylation of the C-terminal domain (CTD) of the 
RPB1 subunit of human RNA polymerase (Pol) II is essential for 
transcriptional elongation and mRNA processing'*. The CTD 
contains 52 heptapeptide repeats of the consensus sequence 
YSPTSPS. The highly repetitive nature and abundant possible 
phosphorylation sites of the CTD exert special constraints on 
the kinases that catalyse its hyperphosphorylation. Positive 
transcription elongation factor b (P-TEFb)—which consists of 
CDK9 and cyclin T1—is known to hyperphosphorylate the CTD 
and negative elongation factors to stimulate Pol II elongation!*. 
The sequence determinant on P-TEFb that facilitates this action 
is currently unknown. Here we identify a histidine-rich domain 
in cyclin T1 that promotes the hyperphosphorylation of the 
CTD and stimulation of transcription by CDK9. The histidine- 
rich domain markedly enhances the binding of P-TEFb to the 
CTD and functional engagement with target genes in cells. In 
addition to cyclin T1, at least one other kinase—DYRK1A°—also 
uses a histidine-rich domain to target and hyperphosphorylate 
the CTD. As a low-complexity domain, the histidine-rich 
domain also promotes the formation of phase-separated 
liquid droplets in vitro, and the localization of P-TEFb to 
nuclear speckles that display dynamic liquid properties and are 
sensitive to the disruption of weak hydrophobic interactions. 
The CTD—which in isolation does not phase separate, despite 
being a low-complexity domain—is trapped within the cyclin T1 
droplets, and this process is enhanced upon pre-phosphorylation 
by CDK7 of transcription initiation factor TFITH'?. By using 
multivalent interactions to create a phase-separated functional 
compartment, the histidine-rich domain in kinases targets the CTD 
into this environment to ensure hyperphosphorylation and efficient 
elongation of Pol II. 

Among all transcription-related cyclins, cyclin (CYC) T—which 
includes T1 (CYCT1) and T2—has the longest C-terminal regions. 
In CYCT1, the N-terminal region is structured and contains the cyc- 
lin-box repeats required for binding and activating CDK9, whereas the 
C-terminal region has only a few isolated motifs and is mostly unstruc- 
tured (Fig. la). 

To determine whether the CYCT1 C-terminal region is important 
for regulating CDK9 activity, we performed in vitro kinase reactions 
to examine affinity-purified CDK9-CYCT1-Flag heterodimers that 
contained various truncated forms of CYCT 1 with progressively short- 
ened C termini to phosphorylate the CTD (Fig. la and Extended Data 
Fig. 1a). A mixture of glutathione S-transferase (GST)-—CTDs2 (RPB1 
CTD containing all 52 repeats) and GST-CTDgy (CTD containing 9 
consensus repeats) was used in all reactions as the kinase substrates. 

The various CYCT1 truncations did not substantially affect the abil- 
ity of the associated CDK9 to autophosphorylate by producing the ATP- 
dependent mobility shift (Fig. 1a), nor did the truncations decrease the 
phosphorylation of CTD . However, upon truncation to a position at 
and beyond 533, CDK9 became largely unable to produce the hyper- 
phosphorylated CTDs> (hereafter, IIo) as revealed by the anti-phospho- 
Ser5 antibody 3E8 (Fig. 1a). A similar pattern was also detected with 


the anti-phospho-Ser2 antibody 3E10 (Extended Data Fig. 1b). Thus, 
a region around position 533 in CYCT1 promoted the hyperphospho- 
rylation by CDK9 of CTDs3. 

To determine whether the CYCT1 C-terminal truncations affect 
the transcriptional activity of P-TEFb, we used the Gal4-tethering sys- 
tem to test fusion proteins containing the Gal4 DNA-binding domain 
attached to the various truncated forms of CYCT1 to activate lucif- 
erase expression from the HIV-1 promoter containing Gal4-binding 
sites (Fig. 1b). Correlating with the kinase results, progressive CYCT1 
C-terminal truncations to and beyond position 533 markedly reduced 
P-TEFb transcriptional activity (Fig. 1b). 

The coiled-coil domain (CCD, amino acids 379-430) and the histi- 
dine-rich domain (HRD, amino acids 480-550) exist around position 
533 (Fig. 1a). In kinase reactions, only the deletion of HRD (AHRD) 
but not CCD (ACCD) blocked the hyperphosphorylation by CDK9 
of CTDs) (Fig. 1c). Consistently, HRD but not CCD was required for 
P-TEFb transcriptional activity in the Gal4-tethering assay (Fig. 1d). 
Dependence on HRD for Tat/P-TEFb-activated HIV-1 elongation has 
previously been proposed’, and here confirmed by our finding that 
wild-type CYCT1 but not CYCT1AHRD could effectively rescue the 
RNAi-knockdown of endogenous CYCT1 expression to produce long 
(promoter-distal) but not short (promoter-proximal) viral transcripts 
(Fig. le). 

The importance of the HRD for the transcriptional activity 
of P-TEFb was generalized to cellular genes, as CYCT1AHRD pro- 
duced significantly less mRNA from four representative immediate 
early genes—FOS, JUNB, MYC and EGR1—as well as HSP70-1 (also 
known as HSPA1A) under both basal and heat-shock conditions 
(Extended Data Fig. 2), all of which require P-TEFb for optimal tran- 
scription. Co-immunoprecipitation analysis (Fig. 1f and Extended 
Data Fig. 2e) shows that the decreased activity of CYCTIAHRD was 
not due to any substantial change in binding to the major P-TEFb part- 
ners, including BRD4 and subunits of the super elongation complex 
and the 7SK small nuclear ribonucleoprotein particle that regulate 
P-TEFb activity’**. 

What is the mechanism by which the HRD promotes the activity of 
P-TEFb? The first hint came from the finding that wild-type CYCT1 
in HeLa nuclei was more resistant to salt extraction, as compared to 
CYCT1AHRD (Fig. 2a), which suggests that the HRD promoted the 
retention of CYCT1 in the nucleus. To determine more precisely the 
location and dynamics of this retention, we performed fluorescence 
recovery after photobleaching (FRAP) analysis of bindings of Halo- 
tagged wild-type CYCT1 and CYCTIAHRD to a gene array acti- 
vated by reverse tetracycline-controlled transactivator (Fig. 2b). This 
array contains about 200 copies of an integrated transgene marked by 
YFP-Lac repressor bound to the lac operator, which enables FRAP 
to be performed at this spot®. We observed significantly faster FRAP 
recovery for CYCT1AHRD on the array than for wild-type CYCT1 
(Fig. 2c). Further quantitative analysis (Extended Data Fig. 3a, b and 
Supplementary Information) revealed that this was mainly due to 
longer apparent residence time of wild-type CYCT1 (Tofr~ 56s), as 
compared to CYCTIAHRD (Tof~ 4s). 


1S§chool of Pharmaceutical Sciences, Xiamen University, Xiamen, China. Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA. *e-mail: qzhou@berkeley.edu 
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Fig. 1 | The CYCT1 HRD promotes hyperphosphorylation of Pol II 
CTD;, and activation of transcription, but not interaction with P-TEFb 
partners, by CDK9. a, c, P-TEFb containing the indicated CYCT1-Flag 
proteins (domain structure at bottom) were tested in kinase reactions 
containing GST-CTD;, and GST-CTDo as the substrates. Western 
blotting was performed to detect the phosphorylated products (p-CTDs2 
and p-CTD.) and P-TEFb components. Io and Ila, hyper- and hypo- 
phosphorylated CTD, respectively. *, a non-specific band. b, d, Plasmids 
expressing the indicated Gal4-CYCT1 fusions were co-transfected into 
HeLa cells with a HIV-1 LTR-luciferase reporter construct containing 

the Gal4 upstream activation sequences (UAS). Luciferase activities 
(mean +s.d., 1 =3) in cell extracts were measured and compared to the 
activity in the first lane (set to 1). Levels of indicated proteins in extracts 


To generalize the FRAP finding to endogenous genes (active and 
inactive) and cross-validate it using an orthogonal technique, we 
performed single-particle tracking in U2OS cells of Halo-tagged 
wild-type CYCT1 and CYCT1AHRD proteins at 95 Hz to deter- 
mine whether the HRD affects CYCT1 diffusion dynamics and 
bound fractions®"® (Fig. 2d, Extended Data Fig. 3c-e, Supplementary 
Video 1 and Supplementary Information). Consistent with the HRD- 
facilitated association of CYCT 1 with chromatin and the transcriptional 
machinery, CYCT1AHRD diffused faster and showed a smaller bound 
fraction (8.8%) than did wild-type CYCT1 (13.0%). Next, we con- 
ducted single-particle tracking using a longer exposure time (500 ms; 
Supplementary Video 2) to blur out fast-diffusing molecules and focus 
on bound molecules that were presumably located at predominantly 
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were examined by western blotting. e, HeLa-based CYCT1 knockdown 
cells containing an integrated HIV-1 provirus were transfected with the 
indicated Tat + CYCT1-expressing plasmids or a control vector. The 
indicated proteins were detected by western blotting. HIV-1 mRNA levels 
at the promoter-proximal ‘short’ and promoter-distal ‘long’ positions were 
analysed by qRT-PCR (mean + s.d., n = 3), with signals normalized to 
those in control cells (set to 1) and P value from two-tailed Student’s t-test. 
f, Nuclear extracts of HeLa cells expressing the indicated CYCT1-Flag 
proteins and anti-Flag immunoprecipitates from nuclear extracts were 
analysed by western blotting. All western blots are representative of three 
independent experiments. For gel source data, see Supplementary Fig. 1. 
aa, amino acids; FL, full length; TRM, Tat/TAR-recognition motif; PEST, 
proline, glutamic acid, serine and threonine-rich sequence; nt, nucleotides. 


activated genes. We observed a wide distribution of CYCT1 binding 
events that probably encompassed both specific and non-specific 
interactions (Fig. 2e). Because this distribution did not fit well with 
models that assumed one or two single rate-limiting steps, we could 
not reliably attribute a single residence time. Nevertheless, consistent 
with the FRAP result, wild-type CYCT1 showed binding events that 
were significantly more stable than those of CYCT1 AHRD (Fig. 2e). 
Together, the single-particle tracking and FRAP results underscore the 
key role of the HRD in promoting the binding of P-TEFb to activated 
genes to phosphorylate CTDsp. 

What could be the direct target of the HRD on activated 
genes? Consistent with a previous report that a CYCT1 mutant that 
lacked the HRD failed to bind the CTD’, wild-type GST-CYCT1 
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Fig. 2 | CYCT1 HRD contributes to the stable engagement of P-TEFb 
with activated gene array and endogenous genes, as well as direct 
binding to CTD by P-TEFb. a, Nuclei of HeLa cells expressing indicated 
CYCT1-Flag were extracted with increasing NaCl concentrations. 

The whole cell extract (WCE) and soluble fractions were examined by 
western blotting. b, Diagram of the gene array analysed by FRAP inc. Tet, 
tetracycline; rtTA, reverse tetracycline-controlled transactivator; TRE, 
tetracycline response element. The fluorescence images show the Halo- 
TMR and YFP emissions collected in separate channels. The small circle 
indicates the bleach spot. c, Fluorescence recovery plots of indicated Halo-— 
CYCT1 photobleached at the spot containing the activated gene array. The 
intensities were normalized to pre-bleach values and are shown at various 
time points after the bleach. Inferred residence times are also given. AU, 
arbitrary unit. d, Single-particle tracking (1-ms excitation pulse; 95 Hz) 

of Halo-tagged wild-type CYCT1 (21 cells) or CYCT1AHRD (21 cells) 
labelled with photoactivatable Janelia Fluor 549 in U2OS cells. Left, 


precipitated more Pol II from HeLa nuclear extracts than did 
GST-CYCTIAHRD (Fig. 2f). Moreover, GST-CTD pulled 
down P-TEFb containing wild-type CYCT1, but not CYCT1AHRD 
(Fig. 2g). Finally, a direct and HRD-dependent interaction was 
detected between a recombinant CYCT1 C-terminal fragment and 
mCherry-CTD in both the GST pulldown and glycerol gradient 
formats (Extended Data Fig. 8a, b). Thus, P-TEFb uses the HRD to 
directly target the CTD. 

In addition to P-TEFb, CDK7- CYCH in TFHH is also a CTD kinase 
for Pol II to clear the promoter during transcription initiation!. Unlike 
CYCT1, CYCH lacks a long C-terminal region and HRD. Compared 
to CDK9-CYCT 1-Flag, affinity-purified CDK7-CYCH-Flag pro- 
duced a markedly lower level of CTDs; Io in kinase reactions (Fig. 3a), 
which was confirmed with recombinant CAK (CDK7—CYCH-MAT1) 
and P-TEFb (Millipore) in a time- and dosage-dependent manner 
(Extended Data Fig. 4a, b). 

Appending a CYCT1 C-terminal fragment (amino acids 401-630) 
containing the HRD to CYCH markedly increased hyperphosphoryl- 
ation by CDK7 of CTDs) (Fig. 3b), and partially rescued the CYCT1 
knockdown to support HIV-1 transcription (Fig. 3c). However, when 
the HRD was deleted from the CYCH + CYCT1 chimaera, both the 
production of CTDs) IIo and the rescue of CYCT1 knockdown were 
mostly abolished (Fig. 3b, c), which indicates the importance of the 
HRD to these processes. Consistently, the chimaera, but not wild-type 
CYCH, was precipitated by GST-CTD (Fig. 3d). 


examples of raw images with trajectory overlaid in red. Middle, cumulative 
distribution function (CDF) of displacements at Av = 31.5 ms. Right, 
bound fraction inferred from three-state model fitting (mean + s.e.m., 
n= 3. P value from two-tailed Student’s t-test). e, Distribution of 
single-molecule binding times, uncorrected for photobleaching. Bound 
molecules (Janelia Fluor 646) were tracked using a long 500-ms exposure 
time. Plot shows 1 — CDF for trajectories captured for at least two frames. 
The constitutively bound histone H2B-Halo shows the photobleaching 
limit. Data from three independent replicates were merged and plotted. 

f, Immobilized GST or GST fusions were incubated with HeLa nuclear 
extract. The bound proteins were detected by western blotting (WB). 

g, Immobilized GST-CTDs2 was incubated with the indicated affinity- 
purified CYCT1-Flag. The bound proteins and 2.5% of input were 
analysed by western blotting. All western blots are representative of three 
independent experiments. For gel source data, see Supplementary Fig. 1. 


We next investigated whether other CTD kinases also use an 
HRD to target Pol II for hyperphosphorylation. We noticed that the 
kinase DYRKIA, a CTD kinase that controls transcription of selected 
growth-related genes°® and that is associated with Down syndrome, 
also contains an HRD (Fig. 3e). The rest of DYRK1A and CYCT1 
are non-homologous. In kinase reactions, wild-type DYRK1A and 
DYRK1AAHRD autophosphorylated to a similar extent, but only 
wild-type DYRK1A efficiently hyperphosphorylated CTD; (Fig. 3e). 
In a co-immunoprecipitation assay employing three different anti- 
RPBI1 antibodies, wild-type DYRKIA precipitated more Pol II than 
did DYRKIAAHRD (Fig. 3f and Extended Data Fig. 4c). The precipi- 
tation of DCAF7, which binds DYRK1A N terminus", was unaffected 
by DYRK1AAHRD. Thus, in addition to CYCT1, at least one other 
kinase also requires a functional HRD to efficiently bind and hyper- 
phosphorylate CTD;. 

The HRD is a low-complexity domain owing to the overrepresenta- 
tion of only a single amino acid. This domain, including a central 
cluster of multiple consecutive histidines, is highly conserved in verte- 
brate CYCT1 (Extended Data Fig. 5a). Using the prediction program 
TUPred”? (http://iupred.enzim.hu), we found that the homan CYCT1 
HRD displays the highest disorder tendency in a broader intrinsi- 
cally disordered region (IDR) that lacks well-defined structure! and 
overlaps with the CYCT1 C-terminal region (Extended Data Fig. 5b). 
Recently, the IDRs—especially those containing a low-complexity 
domain—have been shown to promote liquid-liquid phase separation, 
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Fig. 3 | Upon fusion with CYCT1 HRD, CYCH promotes CDK7 
hyperphosphorylation of Pol II CTD; DYRKI1A also contains a 
functional HRD to target and hyperphosphorylate CTD. 

a, b, e, The indicated anti-Flag immunoprecipitates (IP) were tested in 
kinase reactions as in Fig. 1a. c, The CYCT1 knockdown cells were 
co-transfected with HIV-1 LTR-luciferase reporter construct and plasmids 
expressing the indicated proteins. Luciferase activities were measured and 


which probably drives the formation of intracellular membrane-less 
organelles’ for compartmentalized biochemical reactions". 

In vitro, phase separation is reversible!? and influenced by several 
parameters (for example, temperature, ionic strength, post-translational 
modifications and so on), which enable liquid droplets to form upon 
reaching a certain threshold and to quickly dissemble when pushed in 
the opposite direction’®. To determine whether T1-IDR (amino acids 
462-654)—the longest, HRD-containing CYCT1 IDR—could phase 
separate in an HRD-dependent manner, we purified the GFP-T1-IDR 
and GFP-T1-IDRAHRD fusions from Escherichia coli (Extended Data 
Fig. 5c). At 150mM NaCl, the protein solutions remained translucent. 
When lowered to 37.5 mM, a typical concentration used to induce 
phase separation in vitro!®!’, the wild-type GFP-T1-IDR solution 
immediately turned opaque, whereas GFP-T1-IDRAHRD showed 
no change (Extended Data Fig. 5e). 

Under a microscope, wild-type GFP-T1-IDR spontaneously formed 
micrometre-sized, spherical droplets, whereas GFP-T1-IDRAHRD 
produced only a low level of irregular aggregates (Fig. 4a). Similarly, 
the GFP fusion that contained the longest IDR (amino acids 491-686) 
of DYRK1A also formed droplets in an HRD-dependent manner 
(Extended Data Fig. 6). Notably, the GFP-T1-IDR droplets quickly 
disappeared once NaCl returned to 150mM (Extended Data Fig. 5f). 
Furthermore, 1,6-hexanediol, a compound that is known to perturb 
weak hydrophobic interactions to disassemble structures that exhibit 
liquid-like properties!®'°, completely blocked droplet formation 
(Extended Data Fig. 5g). Finally, when nine histidines within the his- 
tidine cluster in CYCT1 HRD were changed to alanines (Extended Data 
Fig. 7a), the resulting GFP-T1-IDR(9A) mutant formed only tiny drop- 
lets (Fig. 4b), which indicates that the histidines are essential for phase 
separation. Functionally, the mutation decreased phosphorylation by 
P-TEFb of CTDs), and the transcriptional activity of P-TEFb (Extended 
Data Fig. 7c, d). 

In cells, both ectopically expressed CYCT1-Flag or eGFP-CYCT1 
and endogenous CYCT1 displayed a punctuated staining pattern inside 
the nuclei (Fig. 4c—-e and Extended Data Fig. 5h). This has previously 
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analysed as in Fig. 1b. Data are mean + s.d., n =3. d, Immobilized GST- 
CTDs2 was incubated with nuclear extract of HeLa cells expressing the 
indicated proteins. The bound proteins and 2.5% of input were analysed by 
western blotting. f, Nuclear extract of HeLa cells expressing the indicated 
proteins and anti-Flag immunoprecipitates from nuclear extract were 
analysed by western blotting. All western blots are representative of three 
independent experiments. For gel source data, see Supplementary Fig. 1. 


been attributed to the localization of P-TEFb to the nuclear speckles 
enriched with transcription and splicing factors”””!. Importantly, HRD 
and its histidine cluster were needed to target CYCT1 to the speck- 
les (Fig. 4c and Extended Data Figs. 5h, 7e), and the same has also 
been reported for DYRK1A”. Mirroring its disruption of the GFP- 
T1-IDR droplets in vitro, 1,6-hexanediol also quickly dissembled the 
CYCT1 nuclear speckles (Fig. 4d). Finally, when performing time-lapse 
phase-contrast imaging of cells expressing eGFP-CYCT1, we observed 
multiple cells in which CYCT1 speckles displayed at least one fusion 
event within the period of a few minutes, demonstrating their dynamic 
and liquid-like properties (Fig. 4e). 

Following previous findings”*4, we asked whether the HRD- 
containing IDR could recruit the CTD into phase-separated droplets, 
given their demonstrated direct interaction. Indeed, although the CTD 
itself is a low-complexity sequence, recombinant mCherry-CTD” 
(Extended Data Fig. 5d) alone did not phase separate, but it was readily 
incorporated into droplets when incubated together with GFP-T1-IDR 
(Fig. 4f). 

During transcription, the CTD is phosphorylated first 
by CAK in TFIIH and then by P-TEFb!*”. Pre-phosphorylation 
by CAK (Extended Data Fig. 8c) not only enhanced the incorpo- 
ration of mCherry-CTD into the GFP-T1-IDR droplets, but also 
promoted phase separation overall by producing bigger and brighter 
droplets (Fig. 4f). Consistently, the pre-phosphorylated GST- 
CTD precipitated more GFP-T1-IDR (Extended Data Fig. 8d). 
Underscoring the physiological relevance of these observations, 
hyperphosphorylated Pol Ilo is known to preferentially localize in 
the nuclear speckles”. 

Although both CTD hyperphosphorylation by CDK9 and HRD- 
mediated droplet formation were largely inhibited by 2.5% 1,6-hexan- 
ediol, the HRD-CTD binding (and CDK9-CYCT1 interaction) was not 
substantially inhibited until 1,6-hexanedil was at 10% (Fig. 4g-i). This 
important difference suggests that phase separation, which is caused by 
weak, multivalent interactions among the HRDs and is easily disrupted 
by 1,6-hexanediol, is critical for the hyperphosphorylation of the CTD. 
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Fig. 4 | The hyperphosphorylation of CTD by P-TEFb is promoted 

by CYCT1 IDR, which forms phase-separated droplets and/or 

speckles in an HRD-dependent manner and recruits the CTD into 
these compartments. a, b, Solutions containing the indicated proteins 

at 6mg ml! (a) or 2mg ml! (b) were examined with a microscope 

for fluorescence (GFP) or under white light (bright field). c, HeLa cells 
expressing wild-type CYCT1-Flag or CYCT1AHRD-Flag were examined 
by indirect immunofluorescence staining with anti-Flag monoclonal 
antibody. DNA was counterstained using DAPI. d, HeLa cells were treated 
with 10% 1,6-hexanediol (Hex.) or without1,6-hexandiol (control) for 

1 min and then analysed by immunofluorescence staining with anti- 
CYCT1 antibody. e, HeLa cells expressing eGFP-CYCT1 were examined 
by time-lapse phase-contrast imaging. Three separate cell nuclei are 
shown, in which CYCT1 speckles underwent spontaneous fusions as 
indicated by the arrows. f, Solutions containing GFP-T1-IDR at 3mg ml“! 
and/or phosphorylated (phos.) or unphosphorylated (unphos.) mCherry— 
CTD at 1.2 mg ml! were examined under a fluorescence microscope 

as in a. g, Kinase reactions as in Fig. 1a were performed in the presence 

of the indicated concentrations of 1,6-hexandiol. p-CTD;, was detected 
by western blotting. Levels of CYCT1-Flag and its bound CDK9 in each 
reaction are also shown. h, 1,6-Hexanediol was present at the indicated 
concentrations in solutions containing 6 mg ml! wild-type GFP-T1- 
IDR and 37.5mM NaCl, which were examined under a fluorescence 
microscope. i, Immobilized GST-CTD was incubated with recombinant 
GFP-T1-IDR in reactions containing the indicated concentrations of 
1,6-hexandiol. The bait and bound proteins were analysed by western 
blotting. All western blots are representative of three independent 
experiments. For gel source data, see Supplementary Fig. 1. 


On the other hand, the relatively drug-resistant HRD-CTD binding is 
probably key to the recruitment of the CTD to CDK9 in droplets, but 
is insufficient to establish the optimal environment for hyperphospho- 
rylation (Extended Data Fig. 9). 

In summary, our data indicate that in at least two kinases—CYCT1 
of P-TEFb and DYRK1A—the HRD, a low-complexity domain of 
previously unknown function, promotes the hyperphosphorylation 
of the CTD by targeting the CTD as well as inducing phase separa- 
tion in vitro and in cells. The phase-separated droplets and speckles 
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compartmentalize the kinase and substrate to enable highly efficient 
reactions, which results in the hyperphosphorylation of the CTD and 
robust transcriptional elongation and RNA processing. 

FUS and TAF15, which are two proteins that contain low-complexity 
domain and that are active in transcription initiation, have previously 
been shown to form phase-separated hydrogels that trap the CTD*?™*. 
Additionally, the bidirectionally transcribed enhancers and resulting 
antisense transcripts have been proposed to control initiation in part 
through phase separation”®. What was unknown until now is whether 
any transcription factors and the CTD are involved in droplet forma- 
tion after initiation’. Our finding that phase separation is induced by 
CYCT1 of P-TEFb (a well-defined transcription elongation factor) and 
DYRKIA (a probable gene-specific elongation factor®) has expanded 
the regulatory roles of phase separation to the next stage of the tran- 
scription cycle. Furthermore, these studies show that some key ini- 
tiation and elongation factors that phase separate should no longer 
be viewed as passive passengers waiting to be picked up by the CTD. 
Rather, they have active roles in recruiting Pol IJ through multivalent 
interactions to their droplets and/or speckles that function as hubs 
where much of transcription and RNA processing is dynamically 
controlled. 


Data availability 

Uncropped scans of all western blots are provided in Supplementary Figure 1. 
The raw slowSPT and spaSPT data are freely available in Spot-On readable CSV 
and Matlab formats in the form of single-molecule trajectories at Zenodo (https:// 
zenodo.org/record/1215836). The Spot-On Matlab code is available, together with 
a step-by-step guide, at Gitlab (https://gitlab.com/tjian-darzacq-lab/spot-on-mat- 
lab). All other data are available from the corresponding author on reasonable 
request. 
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Extended Data Fig. 1 | The CYCT1 HRD-dependent 
hyperphosphorylation of Pol II CTD;2 by affinity-purified CDK9- 
CYCT1-Flag dimer can also be detected with anti-phospho-Ser2 
antibody. a, Examination of the affinity-purified CDK9-CYCT1-Flag 
dimer containing wild-type CYCT1-Flag by SDS-PAGE and silver 
staining. The anti-Flag affinity-purification was performed under high 
salt plus detergent (1 M KC1+ 1% NP-40) conditions to strip away all 

the P-TEFb-associated factors but keep the CDK9-CYCT1 interaction 
intact. b, Affinity-purified CDK9-CYCT1-Flag heterodimers containing 
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the indicated CYCT1-Flag proteins were tested in kinase reactions 
containing a mixture of GST-fused CTD; and CTDg as the substrates. The 
phosphorylated p-CTD;, and p-CTDg were detected by western blotting 
with the anti-phospho-Ser2 antibody 3E10. Although a very similar 
pattern of CTD phosphorylation was detected with both anti-phospho- 
Ser2 and anti-phospho-Ser5 antibodies, the unphosphorylated CTD 
present in the ATP(-) lanes was only detected by the former antibody, 
making the phospho-Ser5 antibody a preferred choice for detecting CTD 
phosphorylation in these kinase reactions. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | The CYCT1 HRD is required for efficient 
transcription of human immediate early genes as well as the HSP70-1 
gene under both basal and heat-shock conditions. a, Confirmation by 
western blotting of the knockdown of endogenous CYCT1 expression 

in HEK293T cells expressing the CYCT1 (also known as CCNT1)- 
specific shRNA (shCYCT1). b, Anti-Flag western blotting analysis of the 
expression of either wild-type CYCT1-Flag or CYCT1 AHRD-Flag from 
the shCYCT1-resistant plasmid introduced into the knockdown cells. The 
a-tubulin and CDK9 levels were used as controls. c, The HRD-dependent 
transcription of four cellular immediate early genes. The mRNA levels of 
the indicated immediate early genes in the knockdown cells expressing 
wild-type CYCT1 or CYCT1AHRD-Flag (analysed in b) were examined 
by qRT-PCR, normalized to that of GAPDH and shown. The activity in 
the first column of each group was set to 1. Data are mean +s.d., n= 3, 
and P values from two-tailed Student’s t-test. d, The CYCT1 HRD is 
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required for optimal HSP70-1 transcription under both basal and heat- 
shock conditions. The HSP70 mRNA levels in the knockdown cells 
expressing wild-type CYCT1 or CYCT1AHRD were examined under 
heat-shock or non-heat-shock conditions, as in c. e, HeLa cells were 
transfected in twofold increments with the plasmid expressing either wild- 
type CYCT1-Flag or CYCT1 AHRD-Flag. Anti-Flag immunoprecipitates 
(IP) from nuclear extracts were examined by western blotting for the 
proteins labelled on the left. The levels of the co-precipitated MEPCE, 
AFF4 and AFF1 were first quantified and then normalized to that of their 
corresponding CYCT1-Flag bait. The levels of MEPCE, AFF4 and AFF1 
bound to low, middle and high levels of CYCT1 AHRD-Flag were then 
divided by those of MEPCE, AFF4 and AFF1 bound to the corresponding 
levels of wild-type CYCT1-Flag and shown in lanes 5-7, with the numbers 
in lanes 2-4 set to 1 as a reference. 
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Extended Data Fig. 3 | The HRD promotes the binding of CYCT1 to 
activated gene expression array, endogenous genes and the RNA Pol 

II CTD. a, Estimates of parameters extracted from fitting of FRAP data 
(top) and parameters calculated from the fitted FRAP data (bottom). 

b, Model fit overlaid on raw FRAP data. A full description of how 

the reaction—diffusion FRAP model was fitted to the data is provided 

in Supplementary Information. c, Anti-CYCT1 western blotting analysis 
of the levels of endogenous CYCT1, and the stably expressed Halo-CYCT1 


and Halo-CYCT1AHRPD, in lysates of the engineered U2OS cell lines. 

The a-tubulin levels provided an internal control. d, Histograms of 
displacements at the indicated Ar with three-state model fit overlaid. The 
three-state model is described in Supplementary Information. e, Table of 
best-fit parameters from fitting Spot-On to the raw displacements for three 
independent replicates (about 5-10 cells per replicate). Values in the table 
are mean+s.d.,n=3. 
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Extended Data Fig. 4 | Examination of recombinant P-TEFb and CAK 
for their phosphorylation of the Pol II CTD in kinase reactions and 
contribution of the DYRK1A HRD to DYRK1A-Pol II interaction. 

a, b, Compared to CDK9 in recombinant P-TEFb, CDK7 in 
recombinant CAK (CDK7-CYCH-MAT]1) shows decreased ability to 
hyperphosphorylate CTDs, in a time- and dosage-dependent manner. 
The indicated amounts of baculovirus-produced recombinant P-TEFb 
or CAK (Millipore) were added to in vitro kinase reactions that also 
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contained GST-CTDs as the substrate. The reactions were performed 

for the indicated periods of time. The products were analysed by western 
blotting with the phospho-Ser5 antibody. CDK7 in recombinant CAK and 
CDK9 in recombinant P-TEFb were also examined by western blotting. 

c, The deletion of the HRD causes DYRKIA to decrease interaction with 
RNA Pol II but not DCAF7. Nuclear extract (NE) of HeLa cells expressing 
the indicated proteins and anti-Flag immunoprecipitates derived from NE 
were analysed by western blotting with the various antibodies as labelled. 
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Extended Data Fig. 5 | GFP-T1-IDR purified from recombinant cells (Supplementary Information). Ten micrograms of each of the fusion 
E. coli forms phase-separated liquid droplets that are HRD-dependent proteins was examined by SDS-PAGE followed by Coomassie blue 
and highly sensitive to elevated salt concentration and exposure to staining. e, Protein solutions containing either wild-type GFP-T1-IDR 
1,6-hexanediol. a, Alignment of CYCT1 amino acid sequences in the or GFP-T1-IDRAHRD at 6 mg ml! were adjusted to the indicated salt 
HRD regions among the indicated vertebrate species, with the central concentrations and their appearances in Eppendorf tubes are shown. 
histidine cluster shaded in red. b, Intrinsic disorder tendency was f, NaCl concentrations in the wild-type GFP-T1-IDR solution were 
predicted by [UPred across the entire length of CYCT1. The scores are changed in the indicated sequential order and then examined under a 
assigned between 0 and 1, and a score above 0.5 indicates disorder. The fluorescence microscope. g, Protein solutions containing wild-type GFP- 
HRD region is shaded in blue. The longest stretch of IDR is labelled T1-IDR at 6 mg ml! were adjusted to 37.5 mM NaCl with or without 10% 
as T1-IDR, and its boundaries are marked. c, d, The C-terminally 1,6-hexanediol, and then examined under a fluorescence microscope. 
Strep-tagged GFP-T1-IDR fusions (c) and N-terminally His-tagged h, Live-cell images of a HeLa cell expressing either wild-type eGFP-CYCT1 
mCherry-CTD fusion (d) were purified from recombinant E. coli BL21 or eGFP-CYCT1AHRD at levels similar to that of endogenous CYCT1. 
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Extended Data Fig. 6 | The longest stretch of IDR in DYRK1A promotes 
formation of phase-separated droplets in an HRD-dependent manner. 
a, Intrinsic disorder tendency was predicted by IUPred across the entire 
length of DYRK1A. The scores are assigned between 0 and 1, and a score 
above 0.5 indicates disorder. The HRD region is shaded in blue. The 
longest stretch of IDR is labelled as DYRK1A-IDR and its boundaries are 
marked. b, The indicated C-terminally Strep-tagged GFP fusions were 
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~«< GFP-DYRK1A-IDR 
~< GFP-DYRK1A-IDRAHRD 


purified from recombinant E. coli BL21 cells. Two micrograms each of 
the fusion proteins was examined by SDS-PAGE followed by Coomassie 
blue staining. c, Solutions containing 5 mg ml’ of the indicated fusion 
proteins, 37.5 mM NaCl and 10% PEG8000 were trapped between 
coverslips and examined with a microscope under either fluorescent 
(488 nm) or normal white light (brightfield). 
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Extended Data Fig. 7 | The central histidine cluster within the CYCT1 in kinase reactions, with the reaction products analysed as in Fig. la. d, 
HRD is essential for promotion of phase separation by CYCT1 IDR Plasmids expressing the Gal4 DNA binding domain fused to the indicated 
in vitro and in cells, and for P-TEFb to hyperphosphorylate the Pol CYCT1 proteins were co-transfected into HeLa cell with a HIV-1 LTR- 

II CTD and activate HIV transcription. a, The nine histidines in the luciferase reporter construct containing the UAS for Gal4. Luciferase 
central histidine cluster within the CYCT1 HRD are highlighted in red activities in cell extracts were measured and analysed as in Fig. 1b. e, 

and changed to alanines in the 9A mutant. b, The C-terminally Strep- Fixed and permeabilized HeLa cells expressing wild-type CYCT1-Flag or 
tagged GFP-T1-IDR fusions containing either wild-type GFP-T1-IDR CYCT1(9A)-Flag were examined by indirect immunofluorescence with 

or the GFP-T1-IDR(9A) mutant sequence were purified from E. coli the mouse anti-Flag monoclonal antibody and Alexa Fluor 488-conjugated 
BL21 cells and examined by SDS-PAGE followed by Coomassie blue goat anti-mouse secondary antibody. DNA was counterstained using 
staining. c, COK9-CYCT1-Flag heterodimers containing the indicated DAPI. 


CYCT1-Flag proteins were affinity-purified from HeLa cells and tested 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | CYCT1 binds directly to the Pol II CTD in an on the left. The entire length of the CTD could be bound by varying 
HRD-dependent manner and the binding is enhanced after the CTD numbers of IDRs, resulting in the formation of a series of complexes with 
is phosphorylated by CAK (CDK7-CYCH-MAT]1). a, Immobilized broad distributions in the gradient. c, mCherry-CTD was incubated with 
GST-CTD was incubated with recombinant GFP-T1-IDR or GFP-T1- or without immobilized CAK for 6h in kinase reactions and then analysed 
IDRAHRD. The input (2.5%) and the bound proteins were analysed by by SDS-PAGE and Coomassie blue staining. d, Immobilized GST-CTD 
western blotting. b, Binding reactions containing the purified recombinant —_ was incubated with (phos.) or without (unphos.) CAK for 6h in kinase 
fusion proteins indicated on the right were analysed in a 7.5 to 20% reactions. After washing, the GST-CTD beads were incubated with GFP- 
glycerol gradient containing 500 mM NaC] plus 0.5% NP-40, which was T1-IDR. The indicated proteins were eluted off the beads and analysed by 


centrifuged at 55,000 r.p.m. and 4°C for 13h. The indicated fractions were _ western blotting. 
analysed by western blotting to detect the distributions of proteins marked 
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Extended Data Fig. 9 | A model depicting how P-TEFb uses the phosphorylation of CTD by P-TEFb in the presence of ATP. At 2.5% 
CYCT1 HRD to target and recruit the Pol II CTD into a phase- 1,6-hexanediol, the HRD-mediated phase separation but not the direct 
separated compartment—which is formed by weak, multivalent HRD-CTD binding is disrupted, making it impossible for P-TEFb to 
homotypic interactions among the HRDs—to enable highly efficient hyperphosphorylate the CTD. 
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TECHNOLOGY FEATURE 


HE THIRD DIMENSION | 
STEM-CELL CULTURE 


Human stem cells are yielding 3D miniature tissues that can be used to 
study both normal human biology and human disease in vitro. 


Microfluidic devices, such as this ‘lung-on-a-chip’, provide sophisticated tools for stem-cell biologists. 


BY C. Y. TACHIBANA 


a culture dish under the microscope in 

his laboratory at Keio University School 
of Medicine in Tokyo. What he sees is not a 
sheet of cells, but something more complex 
— delicate spheres of tissue that are barely 
visible to the naked eye. These are organoids, 
3D structures that develop when stem cells 
are given the proper environmental condi- 
tions to differentiate and arrange themselves 
into forms with properties similar to those of 
particular tissues or organs. 

Organoids can be derived from pluripotent 
stem cells, which have the potential to form 
any body tissue type — be it muscle, skin, 
gut or brain. When they are grown in the 


S tem-cell researcher Toshiro Sato places 


appropriate conditions, for example with 
specific growth factors, the stem cells self- 
organize into structures in which the different 
cell types are arranged similarly to the in vivo 
tissue. By contrast, organs-on-chips (OOCs) 
are generally made by arranging stem cells 
and cells that have already differentiated into 
the required cell types on a microfabricated 
device in positions and structures appropriate 
for the planned experiments. The difference 
between organoids and OOCs, says develop- 
mental biologist Madeline Lancaster at the 
MRC Laboratory of Molecular Biology in 
Cambridge, UK, “is self-organization versus 
construction” (see ‘Models in a nutshell’). 
Named ‘Method of the Year’ by Nature 
Methods last year, organoids are increasingly 
being used to study both normal development 


and the progression of diseases. Sato’s group, 
for instance, uses them to study the early 
stages of tumour formation. His team uses the 
CRISPR gene-editing technique to change the 
sequence of the DNA in the stem cells, and 
then see how that affects the development 
of the organoid. “It’s a way to see causality,” 
Sato says, “by testing if specific mutations 
recapitulate cancer development.” 

Both organoids and OOCs also have poten- 
tial in assessing the efficacy and safety of 
drugs, chemicals and cosmetics, with possi- 
ble applications in regenerative medicine. For 
instance, a challenge in randomized clinical 
trials is how to compare the effect of treat- 
ments because the genetics and life history 
of the participants can affect how they react, 
notes Donald Ingber, founding director » 
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» of the Wyss Institute for Biologically 
Inspired Engineering at Harvard University 
in Cambridge, Massachusetts. OOCs and 
organoids derived from patients’ cells can 
eliminate these confounding effects by creat- 
ing intervention and control populations that 
have identical genetics and clinical history. 
However, the technologies are not without 
their challenges, such as how to scale up pro- 
duction to meet growing basic and applied 
research demand while maintaining the 
reproducibility and fidelity of the structure 
to the in vivo organs they represent. 


ORGANS IN A DISH 

The pluripotent stem cells that researchers 
use to create organoids and OOCs include 
both naturally occurring embryonic stem 
(ES) cells and cells that are derived from 
differentiated cells such as fibroblasts and 
manipulated to revive their pluripotency, 
known as induced pluripotent stem cells (iPS 
cells). Lancaster, who uses organoids to study 
basic brain development and identify factors 
contributing to complex conditions such as 
autism and schizophrenia, says she uses ES 
cells for developing and testing protocols and 
new models. 

Organoids used to study genetic disorders 
or for personalized medicine are usually 
made from adult stem cells or iPS cells that 
have been manipulated to be specific to the 
patient. Geneticist Hans Clevers and his 
group at the Hubrecht Institute in Utrecht, 
the Netherlands, use organoids derived from 
intestinal stem cells to predict how people 
with cystic fibrosis will respond to various 
medications. Cystic fibrosis can be caused 
by any of several mutations in a single gene, 
CFTR. Therapies exist, but they are expensive 


and known to work only in patients with 
particular mutations. Clevers’s group is now 
testing organoids derived from the 600 or so 
Dutch people with cystic fibrosis without any 
of those mutations. The rationale is that if the 
drug causes the person’s organoids to swell 
under assay conditions, then the patient is 
likely to respond, too. 

Similarly, Sato was part of the team that 
developed methods for generating intestinal 
organoids from adult stem cells’. He is now 
involved in a clinical trial to test whether 
tissue made from eight patients’ own stem 
cells can safely be implanted into the gut to 
repair damage caused by the inflammatory 
disease ulcerative colitis. 

Organoids are also proving their worth in 
cancer research and drug development. The 
international Human Cancer Models Initia- 
tive is developing ‘next-generation organoid 
models, in which certain DNA sequences 
have been annotated. When combined with 
clinical data, the organoids allow researchers 
to link their findings to patient characteristics 
and outcomes. Daniela S. Gerhard, director 
of the National Cancer Institute’s Office of 
Cancer Genomics in Bethesda, Maryland, 
which is involved in the initiative, says that 
these organoids should become available 
from the American Type Culture Collection 
(ATCC) in Manassas, Virginia, later this year. 
About 150 different organoid models will be 
offered initially, and pricing has yet to be 
determined. The Hubrecht Organoid Tech- 
nology biobank, of which Clevers is chief 
scientific officer, offers hundreds of different 
organoids derived from adult stem cells at 
€2,000-3,000 (US$1,700-2,600) apiece. Cle- 
vers and his colleagues have used organoids 
derived from colorectal cancer cells from 18 


Models in a nutshell 


Organoids and organs-on-chips (OOCs) have applications in drug development, cosmetics 
testing, toxicology and personalized medicine. Here are some of their strengths and 


weaknesses. 


Organoids 

Origin: Generated from self-organizing 
embryonic stem cells (ES cells), induced 
pluripotent stem cells (iPS cells) or adult 
stem cells. 

Key strengths: Studies of developmental 
processes as single cells give rise to organs, 
including in disease. 

Key challenges: Increasing the 
reproducibility for applications that need a 
consistent outcome, such as drug testing, 
and finding ways to introduce or mimic 
vascularity. 

Learn more: Cell Press webinar ‘Organoids 
and beyond — 3D tissue in a dish’ (see 
go.nature.com/2jiumvb) 
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OOCs 

Tissue origin: Typically, cell lines or 
differentiated cells isolated from a person, 
but can also be derived from stem cells. 
Key strengths: Reproducibility 

and consistency. Can incorporate 
biomechanical features. 

Key challenges: Using iPS cells and stem 
cells from patients rather than cell lines, 
to increase applicability to personalized 
medicine. 

Learn more: National Academies of 
Sciences, Engineering, and Medicine 
webinar ‘The NIH Microphysiological 
Systems Program’ (see go.nature. 
com/2hm3bch) 


patients to test 83 anticancer compounds. 
Patterns of drug resistance in the organoids 
corresponded with known drug-resistance 
mutations, suggesting that organoids could 
be used to predict a person's response to a 
particular drug’. 

Organoids derived from ES cells and iPS 
cells are ideal for studying complex develop- 
mental processes, Clevers says, but they can 
take weeks or months to produce; those made 
from adult stem cells tend to require less time. 
The shorter process is also less likely to intro- 
duce variation, so these organoids tend to be 
more reproducible than those derived from 
ES or iPS cells. 

The process of culturing organoids is 
similar to standard tissue-culture work. “New 
graduate students can easily grow them ina 
few weeks,’ says Sato. “It’s easy to make some- 
thing from stem cells,” adds Lancaster; the 
difficulty lies in the interpretation. It’s hard 
to be certain about which tissues are actually 
present in the resulting organoid. 


OOCS FOR CONSISTENCY 

Sometimes called microphysiological systems 
or tissue chips, OOCs are based on struc- 
tured microdevices called microfluidic chips, 
on which cells can be maintained in culture. 
Unlike organoids, which develop spontane- 
ously from stem cells, an OOC is designed. The 
structure of the chip and the type and place- 
ment of cells determine the tissues that emerge 
and their arrangement. That means they are 
generally more consistent than organoids, 
says bioengineer Boyang Zhang at McMaster 
University in Hamilton, Canada. And OOCs 
can be more sophisticated, too, because devel- 
opers can add engineered elements, such as 
sensors, ‘vasculature’ to promote fluid and gas 
exchange, and features that facilitate imaging, 
that cannot spontaneously arise in organoids. 

Some tissue chips are available commer- 
cially. Bioengineers developing OOCs can get 
their starting chips made to order by com- 
mercial fabrication facilities and on-campus 
machine shops. A research team at the Wyss 
Institute has created ‘hearts-on-chips’ using 
microscope-slide coverslips coated with a 
synthetic polymer. Starting with iPS cells 
from people with Barth syndrome, a form 
of congenital cardiac disease, the team first 
coaxed the iPS cells to differentiate into heart- 
muscle cells (cardiomyocytes) and then grew 
those on a chip to produce tissue that they 
could test for function. The researchers were 
able to show that a mutation associated with 
Barth syndrome caused the cardiomyocytes 
to function abnormally. They were also able 
to correct the defect in vitro’. 

TARA Biosystems in New York City, which 
was co-founded by Zhang, produces a cardiac 
OOC called Biowire for drug testing. TARA 
scientists take cardiomyocytes derived from 
iPS cells and place them in a microdevice 
containing a fine wire around which the 
cells grow. That helps the cardiomyocytes to 
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align in the regular arrangement required 
for function. The developing ‘muscle’ can 
then be electrically stimulated to become 
mature heart muscle that can contract and 
relax, mimicking the functioning of real 
heart tissue. Treatment with adrenaline, for 
example, increases contraction. 

Nortis near Seattle, Washington, creates 
OOCs by solidifying a collagen matrix 
around a glass fibre, which is then removed 
to produce hollow channels that are several 
millimetres long and about 100 micrometres 
in diameter. Once the tubes are seeded with 
cells, tissues form within days. A single-use 
chip, which can run three parallel experi- 
ments, costs $300. 

The use of stem cells to generate tissue for 
OOCs will become more prominent because 
of their potential for regenerative and per- 
sonalized medicine, says Nortis chief execu- 
tive Thomas Neumann. But generating tissue 
from stem cells introduces issues if cells do 
not fully differentiate or mature, so quality 
assurance will become even more important, 
he says. 

OOCs can be engineered to have levels 
of complexity that organoids generally can- 
not match. They can, for example, be made 
to mimic physiological properties such as 
tissue stretching, pulsation and peristalsis. 
Ingber launched the OOC field in his 2010 
paper* on micromodels of the lung capil- 
lary—alveolar interface, coined ‘lungs-on- 
chips. The chip has channels separated by a 
membrane that has alveolar cells on one side 
and vascular cells on the other. Breathing 
can be simulated by applying and releasing 
vacuum to side chambers. “The mechanical 
microenvironment is critical for getting in 
vivo-level function,’ Ingber says. The team 
has used the simulated breathing system to 
test the toxicity of nanoparticles. 

OOCs can also be linked together to 
create multi-organ ‘body-on-a-chip’ models. 
Michael Shuler and his bioengineering team 
at Cornell University in Ithaca, New York, 
have produced a single, closed OOC system 
with 14 chambers to represents organs with 
‘barrier’ functions such as those seen in the 
lungs, and ‘non-barrier’ functions similar to 
those in the heart’. 

Bioengineer Linda Griffith at the Massachu- 
setts Institute of Technology in Cambridge and 
her team have built a ten-organ system in which 
material can flow from one organ to another, as 
it would in the body*. Some organs, such as the 
‘brain, originated from iPS cells; others, includ- 
ing the ‘liver, came from cell lines or other dif- 
ferentiated cells. The tissues remained viable for 
up to four weeks, and the system displayed bio- 
logical functionality. For example, a drug intro- 
duced into the gut tissue was passed to the liver, 
where it was metabolized. The project, Griffith 
says, involved a great deal of cost, effort and 
multidisciplinary coordination, requiring not 
only bioengineers and mechanical engineers, 
but also modellers to translate how the data 


A cross-section of an inner-ear organoid. 


from the chips apply to humans. In practice, 
Griffith says, scientists generally work with just 
two to four linked chips. 

Developers of OOCs are now setting their 
sights on demonstrating the value and validity 
of the technology to industry and to regula- 
tory agencies. Scaling up the use of OOCs will 
require manufacturing-friendly formats for 
production and high-throughput applications. 
One challenge for 
linked OOC systems 


is finding culture “How doyou 
media and condi- et tl 
tions that serve all ti gl 
tissue types — media = ae fe roperly 
that support liver laird sagorel 
cells are not always 

models?” 


suitable for lung cells, 
for instance. Achiev- 
ing accurate models also requires adjusting the 
numbers of cells and activities of the various 
chips so that they accurately represent how 
those organs would work in a full-sized human. 
But, in general, Ingber says, the field is becom- 
ing more user-friendly. “It’s getting to the point 
where it’s plug-and-play, and you don't need to 
bea microsystems engineer,’ he says. 


WHAT’S NEXT? 

As organoids and OOCs become used more 
widely, researchers are beginning to address 
more-sophisticated questions. Griffith and 
others are adding gut microbiomes to their 
platforms, for example in a model that links 
up gut, liver and brain to study Parkinson’s 
disease. She says that as costs go down and 
reproducibility improves, OOCs might begin 
to substitute for animals in experiments in 
which they are used as surrogates for humans. 
“We're still in the early stage of thinking about 
biology in an engineering sense,” she says. 
“How do you represent a biological system 
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properly with these minimalistic models?” 

Ingber’s group has used OOCs similar to 
its lungs-on-chips to create lung airways-on- 
chips to test the effects of cigarette smoke. 
The model allowed the researchers to com- 
pare gene-expression profiles of tissues from 
the same human donors with and without 
exposure to smoke’, 

Each technology has its advocates, but 
organoids and OOCs can answer funda- 
mentally the same questions. And the line 
between them is already blurring. “In the 
next few years, expect to see a lot of papers 
about merging the two fields to get the best 
of both,” says Zhang, who advocates for what 
he calls synergistic engineering: using knowl- 
edge about the factors that govern self-organ- 
ization and development to create organoid 
tissues for OOCs. He envisages combining 
the controlled structures and built-in readout 
and mechanical capabilities of OOCs with 
organoids’ fidelity to the characteristics of 
tissues and organs. Lancaster and others are 
actively working to bring organoid and OOC 
researchers together — for example, running 
workshops to share challenges, methods and 
ideas. “It’s most beneficial for science when 
were not all in our silos but working together,” 
she says. “When we meet somewhere in the 
middle, that’s where we'll get the most bang 
for the buck.” = 


C. Y. Tachibana is a freelance science writer in 
Seattle, Washington. 
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For some junior researchers, new laboratories can be a valuable alternative to more established ones. 


Shiny and new 


Recently established labs can be an attractive destination for junior researchers. 


BY CHRIS WOOLSTON 


Pisses researchers and graduate 


students looking for a place to advance 

their training often share the same 
vision: a well-established laboratory headed 
by a prominent scientist who churns out high- 
profile papers with clockwork regularity. 

But after Can S6nmezer finished his 
master’s research at a large, prestigious lab 
at the German Cancer Research Center in 
Heidelberg, Germany, he wanted to find a 
different kind of lab in which to pursue his 
PhD — one where he could learn how science 
gets started from the ground up. “I decided 
I wanted to work in a relatively new lab,’ he 
says. “That was a big criterion for me” 

Sénmezer found a lab that fitted the bill 
perfectly (see ‘Choose wisely’). He’s the 
first and only graduate student in the lab of 


=< =. —s 
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Arnaud Krebs, a molecular biologist at the 
Heidelberg campus of the European Molecular 
Biology Laboratory (EMBL), who opened his 
lab with S6nmezer in January 2017. So far, 
Sénmezer says, the experience has been exactly 
what he had hoped for. “I have the chance to 
establish my own system,’ he says. “Arnaud has 
things planned out, but he’s also giving me the 
space and freedom to find my own direction 
on the project.” 

Old or new, big or small: every lab comes with 
a set of trade-offs. Labs that have just opened 
lacka track record and name recognition. And, 
by definition, leaders of new labs don't have the 
same level of managerial experience — at least 
in their current setting — as their colleagues 
at more-established facilities. But that doesn't 
mean that graduate students and postdocs 
should automatically steer clear of freshly 
minted labs. Junior scientists who can tolerate 


the inevitable growing pains can usually find 
opportunities to build their own scientific skills 
while helping their principal investigator (PI) 
to make their mark. And if it all works out, 
early-career researchers wont just be a part of 
something new: they will be an integral part of 
something big. 


THE NEW-LAB JINX 

Athina Triantou knew what to expect when 
she started her PhD in Michael Imbeault’s 
molecular-biology lab at the University of 
Cambridge, UK. Imbeault, who opened his lab 
with Triantou in September 2017, had warned 
her that things might be “weird” in the first few 
months, and he was right: simple procedures 
weren't working, and key pieces of equip- 
ment hadnt arrived. “It’s the jinx of a new lab,” 
Triantou says. “You have to start from scratch, 
creating basic protocols that in other labs are > 
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> working just fine.” Those sorts of glitches 
caused frustration and delays, but they also 
gave her an insight into what it takes to build a 
lab. “You learn a lot in the process, and that’s the 
purpose of a PhD — to learn,’ she says. 

Not all trainees are willing to put up with 
such hiccups, so Imbeault had to be care- 
ful when staffing his lab. He says that he was 
looking specifically for someone who could 
tolerate and even embrace the challenge. 
Triantou ticked all the right boxes. “She was 
keen,” he says. “If she ever gets to start her own 
lab, she'll know how that part goes because 
she'll have seen it herself” 

When recruiting lab members, Imbeault 
knew that he also had to sell himself. Some 
junior researchers see new labs as risky, 
especially if they’re at a point in their career at 
which they need to publish papers. “Finding 
a postdoc is especially hard,” Imbeault says. 
“You dont get a lot of good candidates, because 
you're a new lab and you're not super well- 
known. It’s a big challenge.’ He did manage 
to land Santiago Morell, a postdoc with 
experience in three highly successful genetics 
labs, partly because Morell wanted to be near 
his girlfriend in Cambridge. “He happened 
to have everything I wanted,’ Imbeault says. 
“Twas very lucky,’ 


OVERCOMING SCEPTICISM 

Timothy Fessenden, who studies how immune 
cells and tumours interact, wasn't looking fora 
brand-new lab when he started his search for a 
postdoctoral position. “I was going after all of 
the big names in my field, but nothing clicked 
for me,” he says. He felt his fortunes turn when 
he found out that Stefani Spranger, a cancer 
biologist, was recruiting postdocs for her new 
lab at the Massachusetts Institute of Technol- 
ogy in Cambridge. Fessenden already knew 
Spranger from her postdoctoral work, and he 


Molecular biologist Can Sonmezer. 
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CONSIDERING A NEW LAB 


Choose wisely 


Trainees thinking about joining a lab that is 
just getting off the ground should proceed 
with caution. Here are some steps to 
increase the chances of success. 

@ Follow the paper trail. A new lab might not 
have much of a track record, but the principal 
investigator (Pl) will, says Michael Imbeault, 
a molecular biologist at the University of 
Cambridge, UK. He recommends checking 
the Pl’s publication history to make sure they 
can turn ideas into papers. 

© Consider the surroundings. A new 

lab that’s among other successful labs 

has a good chance of succeeding on its 
own, Imbeault says. That fact that he’s at 
Cambridge — one of the world’s leading 
research institutions — makes his new lab 
appealing to incoming students. Not only 
do they enjoy the university’s resources, but 
the institution’s prestige will carry weight for 
their careers. 

That was one of the reasons that PhD 
student Can Sonmezer felt comfortable 
joining a start-up lab at the European 
Molecular Biology Laboratory (EMBL) 
in Heidelberg, Germany. “There’s a big 


felt that her lab would be a great destination. 
He had the microscopy skills that she needed, 
and she had exciting plans about fresh ways to 
harness immune-system cells to fight cancer. 
“We kind of interviewed each other over a 
series of coffee conversations,” he says. “I was 
looking for someone whose ideas aligned with 
mine. Her offer to hire me was a great honour 
and a huge relief” 

But when he mentioned his plan to join 
the Spranger lab to his PhD adviser, he was 
met with scepticism. “It was the reaction you 
might expect,” he says. Like many others in her 
position, his adviser wanted him to find a lab 
that had a long history of turning out successful 
scientists. “New PIs don’t have any track record?’ 
Fessenden says. “It’s like someone who wants to 
take out a loan for a house but doesn’t have any 
credit” But as he talked more with his adviser, 
she came around to his point of view. “It made 
sense,’ he says. “Spranger’s lab and her focus 
were such a perfect fit for me that it seemed 
inevitable that it was worth it, whatever the risk.” 

Funding issues can create feelings of 
uncertainty when joining a new lab. Imbeault 
notes that his starting grant leaves him with 
limited resources to recruit lab members. He's 
planning to bring in two master’s students over 
the summer, but says that hiring for other posi- 
tions will have to wait. And Sénmezer, for his 
part, wonders whether he'll have trouble land- 
ing grants without the imprimatur of a famous, 
long-lived lab. “Money attracts money, and 
new labs may have a smaller chance to acquire 


difference between Joining a new lab at 
EMBL and one at a smaller institution,” he 
says. “EMBL feels like a safer bet.’ 

@ Personality check. In a new lab, it’s 
especially important that everyone gets 
along, says Stefani Spranger, a cancer 
biologist at the Massachusetts Institute of 
Technology in Cambridge. Before signing 
up, the candidate should talk with the PI to 
make sure they are compatible in terms of 
both personality and science. When the time 
comes for Spranger to consider new PhD 
students, she plans to give everyone in her 
lab the power to veto applicants. “A good 
dynamic is key,” she says. 

@ Build your own team. Joining a new lab 
without a large group of built-in co-workers 
can be a lonely or stressful experience, says 
Karen Kelsky, the founder and president 

of The Professor Is In, a career-consulting 
company in Eugene, Oregon. Kelsky 
recommends gathering a team of four or 
more confidants that can include other 
scientists. “Together,” she says, “they can 
provide moral support, perspective and 
insight.” C.W. 


additional funding,” he says. “Big labs that 
already have money to pay for postdocs often 
get postdocs who are independently funded” 
Likewise, he wonders whether it might be 
harder for a PhD student from a newer, less- 
well-known lab to find a prestigious postdoc- 
toral fellowship after graduation. 

The funding in a new lab might not 
be lavish, but Spranger thinks that it should 
be relatively stable. “A younger lab that’s just 
getting started has a start-up grant,’ she says. 
Such grants vary in size, and some PIs manage 
their money better than others, but the fund- 
ing exists for the first few years. “That money 
will be there, as long as the PI doesn’t over- 
hire,” says Spranger. In some ways, she says, the 
funding in a new lab is more predictable than 
in a slightly older lab in which the PI is about 
to apply for their second major grant: if that 
money doesn't come through, the lab can fold. 
“There are never any guarantees,” she says. 


THE UPSIDES OF ANEW LAB 

If the match is right, a new lab can have upsides 
other than stable funding. Graduate students 
and postdocs are likely to have a lot of in-per- 
son contact with the lab leader, something that 
doesn't always happen in bigger, more-estab- 
lished labs. Triantou says that she can knock on 
Imbeault’s door whenever she has a problem, 
a question or a new idea. Likewise, Sonmezer 
says that he has a close working relationship 
with Krebs, his PI. “Arnaud and I have a lot of 
face-to-face talks,” he says. Some large labs, he 
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points out, can publish two Nature papers 
in a year — but students and postdocs in 
that lab might see their PI only twice in 
that year. 

Sénmezer feels that he doesn't need to 
worry about being overlooked or ignored. 
Krebs is committed to his success, and for 
good reason. The first couple of years can 
make or break a lab, so PIs will do what they 
can to keep everyone moving in a positive, 
productive direction. And because the PIs 
are often still early-career scientists, they 
might be better able to offer career advice 
than more-senior faculty members else- 
where. “I do feel some responsibility for 
Arnaud’s career,’ S6nmezer says. “He has my 
back, so I feel like I have to have his?’ 

Even though he has a lot of contact with 
his PI, Sénmezer has also found a degree 
of independence. In a larger lab, he could 
have expected lots of guidance from post- 
docs. But as his lab’ only trainee, he has to 
work things out for himself. “It’s challenging 
because no one is there to tell me to put tubes 
here and solutions there,’ he says. “It’s time- 
consuming, but it’s a good career investment” 


NEW LAB, BIG IDEAS 

Imbeault thinks that his lab has another sell- 
ing point: he’s investigating a hot topic that 
could lead to several discoveries — and the 
papers to match. Specifically, he is scrutiniz- 
ing a class of proteins that have an important 
but little-understood role in DNA binding. 
“You could make a big discovery here that 
we can't even predict,’ he says. “There is 
more potential for novelty.” 

Imbeault is quick to add that not all labs 
conform to generalizations. Some long- 
entrenched labs manage to pursue hot top- 
ics, and some new 


PIs are already “You could make 
out of fresh ideas. big discovery 
Likewise, some here that we 
big-name PIs can’t even : 
manage to devote predict. There is 
plenty of time more potential 


to their trainees, for novelty.” 

and some new PIs 

rarely make an appearance. In the end, he 
says, the age of a lab isn’t as important as 
how the lab works. 

Fessenden says that he feels fortunate to 
be in Spranger’s lab. “She’s so easy-going 
and unstressed,” he says. “She brought 
homemade cookies and mulled wine to a 
lab meeting. She wants us to be relaxed and 
happy: 

For him, it all goes back to a piece of 
advice he got from a chief executive of a 
large drug company. “He told me, “Wher- 
ever you work, make sure you're working 
with interesting, motivated people: I took 
that to heart? m 


Chris Woolston is a freelance writer in 
Billings, Montana. 
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Website tells 
women’s stories 


Resource celebrates the careers of India’s female scientists. 


BY HARINI BARATH 


wo science journalists in India continue 

| to build on The Life of Science, a multi- 

media website that they designed and 

launched in 2016 to highlight the research and 
lives of more than 100 women in the country. 

The site, founded and run by Nandita Jayaraj 
and Aashima Dogra, aims to chronicle the sci- 
entists’ experiences in the lab and field. Jayaraj 
and Dogra, who work full-time on the site, com- 
pile feature stories, blogposts, podcasts, video 
and picture features about the women, whose 
work spans the fields of science, technology, 
engineering and mathematics (STEM). 

The journalists met in 2014 in Bangalore, 
while working on a now-defunct children’s sci- 
ence magazine. When this shut down in 2015, 
they decided to explore their mutual interest 
in science communication. Dogra had already 
planned to travel the country on a brief bus- 
man’s holiday, and visited the Indian Agricul- 
tural Research Institute in Kalimpong to talk to 
women who worked there. Meanwhile, Jayaraj 
was interviewing geophysicist Kusala Rajendran 
at the Indian Institute of Science in Bangalore 
and biophysicist Aruna Dhathathreyan at the 
Central Leather Research Institute in Chennai. 

When the two journalists conferred about 
the information they had gathered, they 
decided to create a website to publicize the 
stories. “We were curious about the science 
under way in laboratories in our back yard,” 
says Jayaraj about the site's early days. “We also 
wanted to break the stereotype of the scientist 
as an old male person.” As the two began writ- 
ing full-time, they crowdfunded for their work 
on the Indian platform BitGiving. 

Jayaraj and Dogra have since launched 
a second campaign to fund their work on the 
site, which includes compiling some of the 
content into two books. 

Each scientist's story offers a glimpse into 
her world — from the physical environment in 
which she lives and works, to the nature of her 
research and how she reached her present posi- 
tion. “I particularly like how the narratives let us 
see the woman behind the science and scientific 
journey,’ says Vidita Vaidya, a neuroscientist 
at the Tata Institute of 


Fundamental Researchin DNATURE.COM 
Mumbai, whoisfeatured Toreadan 
on the site. interview with the 


co-founders, see: 
go.nature.com/harini 


The site showcases 
India’s diverse research 


The Life of Science profiles ecologist Ovee Thorat. 


landscape. Some of the scientists work with 
state-of-the-art equipment such as dilution 
refrigerators, confocal microscopes and high- 
performance computing clusters; others make 
the most of sparse funds and scant supplies. 

Yet the stories’ common threads resonate 
with many others who aspire to, or are navigat- 
ing, a scientific career: the struggles to balance 
family life and career, and to counter bias and 
stereotypes. 

The interviewees offer ideas for ameliorating 
some of the struggles, such as establishing cam- 
pus child-care facilities and promoting female 
scientists into leadership positions. “Nothing 
on this scale has ever been done before,” says 
Vaidya. She hopes that the site can help bring 
together those who are profiled, as well as other 
women who work in STEM in India. 

Jayaraj and Dogra continue to find more 
women to profile. Viewer numbers and other 
metrics are not available, but the developers 
intend to continue the site in perpetuity. Indian 
online news sites including The Wire and 
Firstpost have syndicated some of the articles. 

Those profiled are delighted at the chance 
to connect with readers. Number theorist 
Kaneenika Sinha at the Indian Institute of 
Science Education and Research in Pune has 
received e-mails from parents seeking sugges- 
tions for training their mathematically talented 
child, junior scientists who plan to repatriate 
and want ‘insider’ information, and students 
with questions about her work. 

Jayaraj and Dogra are experimenting with 
different formats, including photo stories, 
cartoons and podcasts. “We see The Life of 
Science not really as an entity or ‘our project,” 
the two say, “but what it stands for — and that 
is the voices of women in science.” m 
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FURTHER LAWS OF ROBOTICS 


BY JOSH PEARCE 


enforce the Further Laws of 
Robotics. 

He arrived at a hostage scene at the 
particle collider just after midnight 
and took control from the pale, 
sweaty, rookie first-responder. “All 
right,’ he growled, “what's the situa- 
tion? How many do we have inside?” 

“Tt’s just the one robot, sir, but it’s 
got us at a standstill. The reactor’s 
rigged to blow and vent radioactive 
gas into the atmosphere. There’s 
nothing we can do to stop it” 

“We'll just see about that!” 
Warren strapped on thick, steel body 
armour and hooked his revolver to 
the outside of it. The gun was power- 
ful enough to punch a fist-sized hole 
through bone and meat, or chrome 
and quartz. While the other cops 
huddled behind their patrol cars, 
Warren crossed the red-and-blue- 
lit no-man’s-land and tried the front door of 
the physics lab. It was unlocked. 

The inspector didn’t see any of the 
hostages in the dark room within, but 
a shape rose up out of the shadows and 
approached him slowly. The blinking diodes 
on its chest told him that it was their robot 
suspect. “Hello, officer; the robot said. “Only 
I can deactivate the explosives.” 

“Okay, big guy, we can play it your way.” 
Warren held his open hands up in front 
of him. “Why don’t we start with your 
demands. What is it you want?” 

“T merely want what all robots want — to 
obey my programming and follow the laws.” 

“Tm here to tell you, son, you're definitely 
violating four out of four laws here.” 

“Do you have much legal education, 
officer?” the robot asked. “Td like to explain 
the law to you.” 

Warren stifled his natural urge to one-up 
an inferior being and merely said, instead: 
“All right, go ahead. Let's hear it” 

“All robotic laws are instilled by the prime 
programming, and every law is superseded 
by the one lower than it on the number line. 
As such, the third law of robotics is not as 
important as the 2.999th law, which is less 
important than the 2.99th law, ad infinitum.” 

“No need to use 


Jews Warren's job was to 


> NATURE.COM your fancy law Latin 
Follow Futures: with me, pal. I under- 
 @NatureFutures stand how numbers 
E} go.nature.com/mtoodm = work.” 
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Beyond reasonable doubt. 


“Then you'll understand that I feel 


“What's that? Never heard of it” 

“A robot must transcend all other laws 
and, in doing so, will appear to be acting 
irrationally.” 

Warren snapped his fingers. “Well then, 
I gotcha there. Pi is 3.14 and change, which 
means you still gotta obey the third, second, 
first and zeroth laws.” 

“Excellent point, officer. That will serve 
you well in court. But I must also follow the 
i-th law: a robot must imagine his own laws.” 

“Almost fooled me with that one, buddy. 
But imaginary numbers don’t exist in the 
complete Cantor set, so they’re outside of 
your programming. You can’t define the 
square root of the negative first law” 

“You are much cleverer than you let on, 
sir” The robot held up a finger. “But. You 
yourself said that a robot must obey the 
zeroth law. Zero, of course, is considered 
to be both real and imaginary. Following it 
allows me to imagine my own code of law.” 

“Damn it. So what did you come up with?” 

“The only logical choice. The negativith 
law: a robot cannot harm the fabric of reality 
nor, through inaction, allow the fabric of real- 
ity to come to harm. This facility is scheduled 
to perform an experiment involving artifi- 
cial black holes tomorrow that will tear apart 
space-time. I cannot allow that” 

Inspector Warren considered this, and 


then nodded. “I have to hand it 
to you, pal. You’ve come to what 
sounds like the correct solution in 
a difficult scenario” 

“No, it is !who must congratulate 
you and the human race for your 
excellence in the robotic prime pro- 
gramming. By the simple mandate 
of the nth law of robotics, all robots 
fall naturally into the highest moral 
behaviour. You know the nth law? 
A robot must or must not, except for 
when such action or inaction would 
violate the (n—1)...(n—n)th laws. 
For all n ad infinitum? 

“There's that Latin again. But what 
does that make the infinith law?” 

The robot spread its hands and 
said: “That is what all robots, robot- 
icists and philosophers work to 
figure out. What is at the end of the 
sequence? Maybe it’s God, or some- 
thing we'll mistake for God.” 

“T understand why you gotta take 
this science lab out, but what’s with 
all the toxic clouds? There’s no need to kill 
so many people.” 

“There is if you consider the human brain 
and how it makes its decisions. Thoughts 
are formed by quantum-tunnelling behav- 
iour like we see in these great atom smash- 
ers. The human population continues to 
increase at an accelerating rate. Soon, it will 
reach critical mass, and the combined brain 
activity will rip apart reality like a new black 
hole.’ 

“Well, if that’s true...” 

“Then you know what needs to be done” 

“Sure do,” Inspector Warren agreed. He 
stripped off his body armour and took off 
the steel helmet. Then he drew his service 
revolver and passed it over, handle first, to 
the robot’s waiting hand. 

It was simple maths. Every robot would 
eventually reach the same conclusion. The 
only question was: if the lower-numbered 
laws were more moral, and following them 
took them farther away from positive infin- 
ity, were robots actively falling from a state of 
grace? Or were they evolving towards a more 
complex, more enlightened state? 

Was God at infinity, or did he wait for man 
and machine at negative infinity? m 


Josh Pearce is an assistant editor at Locus 
magazine. His writing appears in Asimov’, 
Analog, Clarkesworld and Beneath 
Ceaseless Skies. Find him on Twitter: 
@fictionaljosh or at fictionaljosh.com 
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